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Chapter  II  of  this  dissertation  provides  an 
introduction  to  the  concepts  of  modern  control  theory. 

We  develop  a  classical  model  of  the  firm  and  demonstrate 
how  modern  control  theory  techniques  are  applicable  to  the 
dynamic  optimization  problem  of  the  firm.  The  transition 
from  static  optimization  to  dynamic  optimization  theory  is 
accomplished  by  reviewing  the  discrete  time  minimum  prin¬ 
ciple  and  applying  this  principle  to  the  classical  problem 
of  profit  maximization. 

Chapter  III  introduces  the  concept  of  linear  quad¬ 
ratic  control  and  develops  a  decentralized  model  of  the 
firm.  We  develop  the  concepts  of  decentralized  decision 
making  and  decentralized  information  availability  to 
formulate  a  well  posed  problem  of  decentralized  control. 
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The  solution  of  the  decentralized  control  problem 
is  presented  in  Chapter  IV  where  we  derive  an  optimal 
decentralized  control  policy  for  a  general  organizational 
team  using  the  mathematical  approach  of  dynamic  program¬ 
ming  and  the  results  of  team  theory.  This  policy  is  then 
applied  to  our  research  problem  to  determine  the  effect 
of  transfer  pricing  policy  on  the  decentralized  decision 
maker's  actions.  The  main  finding  of  this  study  under  the 
stated  conditions  of  the  analysis  is  that  the  transfer 
price  involved  in  the  interdivisional  exchange  of  goods 
or  services  does  not  affect  the  decentralized  team  deci¬ 
sion  maker's  actions. 
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CHAPTER  I 


INTRODUCTION  AND  BACKGROUND 

Introduction 

The  rapid  rate  of  technological  advancements  and 
a  steadily  increasing  growth  in  the  size  of  business 
organizations  have  resulted  in  a  trend  toward  division¬ 
alization  of  these  organizations  in  recent  years.  Divi¬ 
sionalization  leads  to  the  encouragement  of  creative 
talents  of  responsive  individuals,  a  readily  available 
measure  of  segment  success  in  the  form  of  profit  contribu¬ 
tion  and  the  improvement  of  management  training  (Solomons: 
1965).  Decentralized  decision  making  results  in  separate 
divisions  which  are  essentially  autonomous  profit  centers. 
In  a  world  of  certainty  the  objectives  of  decentraliza¬ 
tion  in  business  organizations  through  the  creation  of 
profit  centers  are  likely  to  be  achieved  when  the  managers 
of  various  profit  centers,  acting  in  their  own  self- 
interest,  also  maximize  central  management's  preferences. 
This  idea  holds  trivially  when  there  are  no  interactions 
between  the  various  decision  centers;  i.e.,  no  flow  of 
goods  or  services  between  decision  centers  and  no  cost  or 
demand  interdependence  (Kanodia : 1979) .  However,  these 
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divisions  are  frequently  faced  with  the  problem  of 
pricing  goods  and  services  that  they  exchange  with  each 
other.  Internal  transfer  prices  for  these  goods  and  ser¬ 
vices  are  historically  derived  from  a  corporate  head¬ 
quarters  or  central  coordination  agency  or  through  divi¬ 
sional  negotiation.  The  problem  of  establishing  these 
prices  is  important  since  they  affect  divisional  goal  con 
gruence,  individual  incentive  and  autonomy  of  decision 
making  (Horngren:1977) . 

Statement  of  the  Problem 
Classical  microeconomic  theory  assumes  that  the 
centralized  decision  maker  has  perfect  knowledge  of  all 
the  information  required  for  decision  making.  However, 
we  observe  that  actual  decisions  are  based  on  information 
that  is  both  incomplete  and  imperfect  due  to  the  uncer¬ 
tain  environment  that  the  firm  must  operate  within.  Yet 
the  formal  admission  of  uncertainty  has  not  been  acknowl¬ 
edged  in  most  of  the  transfer  pricing  literature  to  date 
(Demski : 1975) .  This  admission  is  important  because  the 
results  obtained  under  subjective  certainty  do  not  neces¬ 
sarily  extend  to  an  uncertain  setting.  In  addition, 
questions  relating  to  information  discrepancies,  communi¬ 
cation  strategies  and  risk  sharing  only  arise  in  an  uncer 
tain  setting. 


3 


Another  premise  of  classical  economic  theory  is 
that  the  firm  operates  in  a  static  environment  where  cur¬ 
rent  decisions  do  not  impact  future  periods  of  operation. 
In  reality,  the  decision  maker  must  base  these  decisions 
on  a  changing  environment  where  the  information  concern¬ 
ing  the  environment  is  also  changing  over  time.  A  dynamic 
analysis  of  this  information  structure  has  not  been 
addressed  in  the  transfer  pricing  literature  to  date. 
Furthermore,  the  complex  problem  of  decentralized  deci¬ 
sion  making  under  conditions  of  limited  informational 
structures  in  an  uncertain  environment  has  not  been  ack¬ 
nowledged  in  the  literature  to  date.1 

The  problem  of  establishing  an  optimal  (in  some 
sense)  transfer  pricing  policy  for  a  decentralized  firm 
in  an  uncertain,  dynamic  environment  provides  the  impetus 
for  this  dissertation.  In  the  above  sentence,  the  word 
"optimal"  refers  to  the  development  of  a  control  policy 
which  will  encourage  the  maximization  of  a  predetermined 
measure  of  profit.  The  purpose  of  the  control  system  is 
not  to  explicitly  develop  organizational  planning  objec¬ 
tives,  although  the  feedback  received  from  a  viable  con¬ 
trol  system  frequently  results  in  planning  adjustments. 
This  type  of  closed  loop  system  meshes  the  functions  of 

important  exceptions  are  Arrow  (1964)  ,  Groves 
(1973) ,  Marschak  (1959)  and  Wilson  (1968) . 
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management  planning  and  control  in  actual  operating  cir¬ 
cumstances;  however,  for  the  purpose  of  this  dissertation, 
it  is  assumed  that  optimal  control  refers  to  the  actions 
required  to  optimize  predetermined  planning  objectives. 

Background 

In  an  organization,  individuals  normally  differ 
in  at  least  three  important  aspects;  (1)  they  control 
different  action  variables;  (2)  they  base  their  decisions 
on  different  information;  and  (3)  they  have  different 
preferences;  i.e. ,  tastes  and  beliefs.  A  normative 
analysis  of  organizations  could  thus  be  suitably  modelled 
as  a  methematical  game  theory  problem  (Radner: 1972a) . 
However,  many  interesting  aspects  of  organizations  are 
related  to  differences  of  types  (1)  and  (2)  only.  Further¬ 
more,  in  some  cases  the  members  of  an  organization  may 
have  nearly  identical  preferences.  Finally,  in  the  pres¬ 
ent  state  of  development,  the  theory  of  games  of  more  than 
two  persons  does  not  appear  to  provide  many  clues  as  to 
how  to  proceed  in  a  general  analysis  of  organizations 
(Radner ; 1972b) .  This  suggests  the  study  of  theoretical 
organizations  in  which  differences  of  type  (3)  are  absent; 
that  is,  in  which  preference  differences  are  neglected 
and  a  single  payoff  function  reflects  the  common  goals  of 
the  members.  Jacob  Marschak  (1955)  has  termed  such  an 
organization  a  team.  In  the  theory  of  teams  two  basic 


questions  are  investigated:  (1)  for  a  given  information 
structure,  what  is  the  optimal  decision  function;  and  (2) 
what  are  the  relative  values  of  alternative  information 
structures . 

The  impact  of  a  transfer  pricing  change  on  the 
actions  of  decentralized  decision  makers  has  not  been 
investigated  in  a  team  setting.  This  impact  may  not  be 
trivial  in  that  it  is  not  clear  exactly  how  decentralized 
decision  makers  evaluate  available  information.  This  dis¬ 
sertation  research  will  attempt  to  develop  a  decentral¬ 
ized  control  model  of  the  firm  that  will  enable  us  to 
evaluate  the  impact  of  various  information  patterns,  to 
include  the  transfer  price,  with  respect  to  the  actions 
taken  by  the  decentralized  decision  makers.  The  disserta¬ 
tion  will  employ  a  team  theoretic  approach  to  the  decen¬ 
tralized  control  problem  which  will  allow  investigation 
of  the  impact  of  a  change  in  transfer  pricing  policy  on 
the  decision  maker's  actions.  Most  of  the  team  theory 
literature  has  not  addressed  the  team  problem  in  a  dynamic 
environment;  however,  current  research  in  the  fields  of 
economics  and  engineering  has  begun  to  deal  effectively 
with  dynamic  models  through  the  use  of  modern  control 
theory  analysis. 

The  mathematical  foundations  of  certain  parts  of 
modern  control  theory  can  be  traced  back  to  works  that 
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were  completed  some  seventy  years  ago.  For  instance,  the 
state  variable  approach  to  linear  systems  is  well  known 
to  mathematicians  as  the  theory  of  first-order  linear 
differential  equation  solutions.  State  space  concepts, 
fundamental  to  modern  control  theory,  evolved  from  the 
classical  theory  of  dynamics  of  particles  and  rigid 
bodies,  referred  to  as  phase-space  (Fuller: 1960) .  One  of 
the  significant  aspects  of  modern  control  theory  is  that 
it  is  useful  in  the  analysis  of  multivariate,  stochastic, 
dynamic  systems.  Until  recently,  a  modern  control  theo¬ 
retic  approach  was  limited  to  engineering  problems  deal¬ 
ing  with  the  physical  sciences.  This  approach  has  now 
received  attention  in  various  fields  of  the  social 
sciences,  particularly  in  economic  research.  The  appar¬ 
ent  widespread  use  of  modern  control  theory  techniques  to 
economic  research  can  be  summarized  as  follows: 

Control  theory  methods  are  used  to  find  the  opti¬ 
mal  set  of  policies  over  time  to  direct  a  determinis¬ 
tic  system  or  stochastic  system  from  given  initial 
conditions  to  desired  terminal  conditions.  Since  a 
large  number  of  economic  problems  are  naturally 
described  as  dynamic  systems  which  can  be  influenced 
by  policies  in  an  attempt  to  improve  their  performance, 
control  theory  has  gained  widespread  application  by 
economists.  (Kendrick:1980) 


Objective  of  Research 

The  objective  of  this  dissertation  is  to  develop 
a  conceptual  framework  for  the  analysis  and  control  of 
decentralized  decision  making  for  a  firm  operating  in  a 


dynamic  environment  under  conditions  of  uncertainty.  The 
framework  will  incorporate  a  modern  control  theoretic 
mathematical  structure  and  employ  a  team  theoretic 
approach  to  the  analysis  of  organizational  behavior. 

Thus  the  conceptual  framework  will  attempt  to  embody  the 
economic  concepts  of  team  theory  with  the  mathematical 
concepts  of  modern  control  theory  to  analyze  the  dynamic 
problem  of  optimal  information  structure  and  transfer 
pricing  policy  which  is  of  interest  to  the  accountant. 


Past  Approaches  and  the  Approach 
of  This  Study 

Early  accounting  research  concentrated  on  a  prag- 

2 

matic  approach  to  the  transfer  pricing  problem.  Classi¬ 
cal  economic  analysis  of  the  transfer  pricing  problem  was 
conducted  by  Hirshleifer  (1956)  and  his  paper  is  the 
definitive  reference  in  the  literature.  However,  Hirsh- 
leifer's  procedure  requires  complete  knowledge  of  market 
situations  and  complete  communication  between  decision 
makers;  conditions  that  rarely,  if  ever,  exist  in  the  cur¬ 
rent  business  environment. 

Mathematical  programming  approaches  to  the  estab¬ 
lishment  of  transfer  prices  were  presented  by  Baumol  and 
Fabian  (1964),  Jennergren  (1972)  and  Bailey  and  Boe  (1976). 

2 

See,  for  example,  Cook  (1955),  Dean  (1955), 

Dearden  (1964)  and  Stone  (1956) . 
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These  approaches  require  time-consuming  iterations  of 
information  exchanges  that  are  based  on  sensitive  opti¬ 
mality  assumptions.  The  transfer  pricing  literature 
was  surveyed  by  Abdel-Khalik  and  Lusk  (1974)  and  they  con¬ 
clude  that  the  above  approaches  have  produced  more  ques¬ 
tions  than  answers. 

The  approach  of  this  dissertation  acknowledges 
the  uncertain  and  dynamic  environment  that  exists  in  a 
modern  decentralized  organization  and  employs  a  modern 
control  theoretic  team  approach  to  decentralized  decision 
making.  Our  conceptual  framework  operates  in  a  dynamic 
environment,  incorporates  conditions  of  uncertainty, 
allows  for  multiple  information  structures  and  addresses 
the  issue  of  pricing  interdivisional  transfers  of  goods 
and  services.  Decision  making  for  the  divisions  resides 
with  the  respective  division  managers  (decentralized 
decision  making) .  Decentralized  decision  making  refers 
to  the  following:  given  m  decisions  or  actions  to  be  made 
by  n  decision  makers  (l<n<m) ,  each  decision  maker  is 
assigned  a  subset  of  the  m  decisions.  For  the  overall 
system  there  is  a  given  criterion  function  and  a  space 
of  possible  choices  involving  the  m  decisions.  Each  deci¬ 
sion  maker  is  assigned  a  space  of  possible  choices  and 
a  criterion  function  involving  at  least  the  decision 
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variables  which  he  can  partially  or  totally  control 
(Whinston:1964) . 

The  information  set  available  to  each  decision 
maker  may  also  vary.  Suppose  that  the  ith  person 
observes  a  random  variable  y^  (x)  and  takes  action  a^. 

If  there  is  no  communication  among  the  persons,  then 
person  i's  information  function  is  defined  as  I^(x)  = 
y^(x).  However,  if  there  is  complete  communication  among 
all  n  persons,  then  I^(x)  =  Y (x)  =  (y^ (x) ,  y2 (x) , . . . , 
yn(x)).  Rarely  does  one  encounter  these  two  extremes  of 
no  communication  or  complete  information  in  a  real  organi¬ 
zation.  Rather,  we  find  that  numerous  devices  are  used 
to  bring  about  a  partial  exchange  of  information  (Radner: 
1961)  . 

Overview  of  Contents  of 
the  Dissertation 

Chapter  II  of  this  dissertation  provides  an  intro¬ 
duction  to  the  concepts  of  modern  control  theory.  We 
develop  a  classical  model  of  the  firm  and  demonstrate  how 
modern  control  theory  techniques  are  applicable  to  the 
dynamic  optimization  problem  of  the  firm.  The  transition 
from  static  optimization  to  dynamic  optimization  theory 
is  accomplished  by  reviewing  the  discrete  time  minimum 
principle  and  applying  this  principle  to  the  classical 
problem  of  profit  maximization. 


10 


Chapter  III  introduces  the  concept  of  linear 
quadratic  control  and  develops  a  decentralized  model  of 
the  firm.  We  develop  the  concepts  of  decentralized  deci¬ 
sion  making  and  decentralized  information  availability  to 
formulate  a  well  posed  problem  of  decentralized  control. 

The  solution  of  the  decentralized  control  problem 
is  presented  in  Chapter  IV  where  we  derive  an  optimal 
decentralized  control  policy  for  a  general  organizational 
team  using  the  mathematical  approach  of  dynamic  program¬ 
ming  and  the  results  of  team  theory  developed  by  Radner 
(1962).  This  policy  is  then  applied  to  our  research 
problem  to  determine  the  effect  of  transfer  pricing  policy 
on  the  decentralized  decision  maker's  actions. 


CHAPTER  II 


CENTRALIZED  MODEL  OF  THE  FIRM 

Introduction 

As  we  have  emphasized  in  the  previous  chapter, 
the  purpose  of  this  dissertation  is  to  develop  an  ana¬ 
lytic  framework  which  will  enable  us  to  evaluate  the  per¬ 
formance  of  a  decentralized  firm  operating  in  a  dynamic 
environment  under  conditions  of  uncertainty.  This  chapter 
will  provide  the  analytic  background  necessary  to  under¬ 
stand  the  inherent  difficulties  encountered  when  we  extend 
the  classical  economic  model  of  a  firm  to  achieve  the  dis¬ 
sertation  objective.  A  second  purpose  of  the  chapter  is 
to  develop  an  orderly  transition  from  classical  static 
optimization  techniques  to  the  modern  control  theory 
approach  used  to  solve  optimal  control  problems. 

We  will  develop  a  static  model  of  the  firm  and 
solve  the  attendant  optimization  problem.  Next  we  will 
extend  the  model  to  a  dynamic  environment  and  develop  a 
maximization  principle  that  will  enable  us  to  solve  the 
dynamic  optimal  control  problem.  Following  chapters 
extend  the  model  to  incorporate  decentralized  information 
and  decision  making  and  address  future  uncertainties  which 
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the  firm  faces  and  we  develop  a  procedure  to  solve  the 
resultant  stochastic  control  problem. 

Static  Model  of  the  Firm 
Traditional  management  models  of  the  firm  nor¬ 
mally  include  an  organizational  structure  which  incorpo¬ 
rates  the  functions  of  production  and  marketing  with 
related  costs  of  distribution  and  production  and  related 
sales  revenue.  The  firm's  objective  is  normally  that  of 
profit  maximization.  The  model  we  develop  will  include 
these  concepts  where  profit  becomes  a  function  of  both 
sales  and  costs.  It  should  be  noted  that  certain  empiri¬ 
cal  evidence  exists  to  suggest  that  firms  may  not  regard 
profit  maximization  as  their  sole  overriding  objective.^ 
The  model  we  will  develop  does  not  explicitly  require  the 
assumption  of  profit  maximization  and  could  readily  be 
extended  to  incorporate  behavioral  preferences  through 
the  recognition  of  a  utility  function  which  incorporates 
individual  beliefs  and  preferences.  For  clarity  of 
exposition  our  model  will  retain  the  profit  maximization 
objective  which  implicitly  assumes  that  the  firm's  utility 
function  is  linear  in  dollars.  This  assumption  does  not 
cause  any  conceptual  difficulty  for  a  centralized  firm; 

3 

See,  for  example  Coase  (1972),  Cyert  and  March 
(1968) ,  Jensen  and  Meckling  (1976)  and  Salamon  and  Smith 
(1979) . 
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however,  it  does  become  restrictive  when  we  address  decen¬ 
tralized  decision  making  and  will  be  discussed  further  at 
that  point. 

Figure  2.1  shows  the  eseential  framework  of  the 
model  we  will  develop.  The  functions  of  production  (Divi¬ 
sion  P)  and  marketing  (Division  M)  are  aligned  under  the 
control  of  a  centralized  decision  maker  (Headquarters) . 

Raw  material  required  for  production  is  obtained  in  a 
perfectly  competitive  market  where  Division  P  can  pur¬ 
chase  any  amount  of  material,  qr,  at  the  prevailing  market 
price,  pr-  This  amount  is  processed  into  a  finished  pro¬ 
duct,  q^.  For  ease  of  exposition  we  assume  a  single 
input  production  process  with  no  internal  loss.  This 
notation  assumes  that  the  units  of  raw  material  and  the 
final  product  are  the  same.  This  assumption  could  be 


relaxed  by  defining  the  amount  of  raw  material  purchased 
as  aq^  where  a  represents  the  units  of  input  required  per 
production  unit.  These  assumptions  are  not  essential  to 
the  development  of  the  model;  however,  the  relaxation  of 


them  would  unduly  complicate  the  mathematics  involved 


without  adding  additional  clarity  of  exposition. 

The  marketing  division  receives  the  finished  good 
qp  and  distributes  an  amount  q^  at  a  price  p^.  Unlike 
the  raw  material  market,  the  demand  for  the  firm's  pro¬ 
duct  is  a  function  of  the  sales  price,  p^,  which  the  firm 
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Fig.  2.1.  Centralized  Model  of  the  Firm 


must  establish.  To  achieve  as  much  realism  as  possible 
we  will  assume  that  the  firm  can  mildly  affect  the  demand 
for  its  product  through  advertising,  product  differentia¬ 
tion  and  price.  Thus  its  market  is  neither  purely  com¬ 
petitive  nor  purely  monopolistic.  Chamberlin  (1962)  has 
termed  such  a  situation  as  monopolistic  competition  and 
the  interested  reader  is  referred  to  his  work  for  an 
extensive  treatment  of  the  subject. 

We  will  assume  that  the  firm  has  full  knowledge 
of  the  demand  function  for  its  product  which  exhibits  the 
relationship 


qf  =  -blPf  +  b2 

where  bpK)  and  b2^0  are  known  constants. 

The  firm's  revenue  is  a  function  of  sales  quan¬ 
tity  and  sales  price  as 


R  =  R(qf,pf) 


pfqf  • 


15 


The  firm's  cost  function  consists  of  production 
costs  and  marketing  costs.  We  assume  that  the  marketing 
division  incurs  a  fixed  cost,  Cw,  based  on  a  fixed  sales 
force  and  advertising  budget.  The  production  costs  are 
a  function  of  raw  material  costs  and  internal  processing 
(labor  and  machine)  costs.  In  economics  we  often  encoun¬ 
ter  a  production  cost-volume  relationship  such  as  that 
given  in  Figure  2.2. 


Cost 


Fig.  2.2.  Production  Cost  Function 

Such  a  function  has  the  property  that  marginal 
costs  decrease  initially  but  increase  above  a  certain 
volume.  Empirical  evidence  supports  this  relationship 
due  to  a  learning  effect  and  the  improvement  in  effi¬ 
ciency  that  accompanies  a  volume  increase  (Itami: 1973) . 
However,  we  note  that  the  majority  of  accounting  cost 
analyses  assume  a  linear  production  cost  relationship  of 
the  form  c  =  av  +  b  where  c  is  total  cost,  v  is  total 
volume  and  a,b  are  known  constants.  Although  this  func¬ 
tion  is  convenient  for  analytic  purposes,  we  feel  that 


L6 


marginal  cost  must  eventually  rise  as  volume  increases 
which  is  a  property  not  incorporated  using  a  linear  cost 
function.  To  incorporate  the  concept  of  increasing  mar¬ 
ginal  cost  we  will  assume  a  quadratic  cost  function  of 

2 

the  form  c  =  a  +  bv  +  cv  where  a,b,c  are  constants. 

This  will  enable  us  to  preserve  analytic  tractability  in 
our  model  and  at  the  same  time  achieve  an  implicit  objec¬ 
tive  of  the  analysis  which  is  to  capture  as  much  realism 
as  possible  without  unnecessarily  complicating  the  model. 
By  using  a  second-order  approximation  for  production 
costs  we  assume  that  the  firm  is  operating  somewhere  in 
region  II  (Figure  2.2).  Classical  microeconomic  theory 
tells  us  that  it  is  not  efficient  for  a  firm  to  operate 
in  regions  I  or  III  and  thus  we  feel  that  a  second-order 

estimate  of  production  cost  is  justified  from  both  an 

4 

empirical  and  theoretical  perspective.  The  quadratic 
cost  function  captures  the  concept  of  increasing  marginal 
cost  inherent  in  many  organizations;  however,  it  should 
be  noted  that  many  major  industrial  processes  may  not 
exhibit  quadratic  cost  behavior  and  the  application  of 
this  model  may  not  be  appropriate  for  firms  that  are  not 
operating  in  region  II. 

4 

For  a  further  discussion  of  quadratic  cost 
curves  and  their  fit  to  empirical  data,  see  Holt,  et  al. 
(1960)  and  Spencer  and  Siegelman  (1964). 
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The  production  cost  is  of  the  form 
Cp  =  al  +  Prqr  +  a2qr2 


where  a^,a2  are  known  constants  and  is  the  unit  cost 
of  the  raw  materials.  Thus  a^  represents  fixed  operating 
costs  and  a2  captures  the  concept  of  increasing  marginal 
costs  while  pr  captures  the  material  component  of  the  pro¬ 
duction  costs. 

The  total  profit  of  the  firm  becomes 
Profit  =  P  =  Revenue  -  Expenses 


P 

P 


pfqf  '  al  -  Prqr 


The  firm's  problem  becomes: 

2 

Maximize:  P  =  pfqf  ”  CM  ~  al  ”  prqr  ”  a2qr 
Subject  to:  q^  =  “b^Pf  +  b2» 


It  is  interesting  to  observe  at  this  point  in  the 
model's  development  that  perfect  information  is  assumed 
and  thus  the  issue  of  optimal  information  structure  is 
imbedded  in  the  model's  assumptions.  Similarly,  there  is 
no  transfer  pricing  problem  at  this  juncture  due  to  the 
centralized  aspect  of  the  model.  From  this  perspective 
the  model  is  currently  uninteresting  to  the  accountant 
concerned  with  the  development  of  information  systems  and 
optimal  transfer  pricing  policy.  Following  sections  will 
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extend  the  model  to  allow  us  access  to  the  issues  of 
information  structures  and  transfer  pricing  policy  along 
with  an  appreciation  of  the  difficulties  encountered  as 
we  extend  the  model.  This  extension  will  involve  a 
transition  from  classical  static  optimization  techniques 
to  optimal  control  theory  techniques.  To  facilitate  this 
transition,  the  next  section  will  review  classical  opti¬ 
mization.  We  will  then  extend  the  model  beyond  the 
static  case  and  develop  the  tools  of  optimal  control 
theory  which  will  enable  us  to  pursue  the  analysis  in 
the  remainder  of  our  research. 

Static  Optimization 

Under  the  specified  conditions  of  certainty  prob¬ 
lem  A  reduces  to  the  following,  which  we  designate  prob¬ 
lem  B: 

2 

Maximize:  Pfqr  -  <cM+ai)  “  Prqr  “  a2qr  (B"1) 
Subject  to:  qr  =  -b-^Pf  +  b2«  (B-2) 

Problem  B  can  be  solved  as  a  static  optimization 
problem  where  the  firm's  decision  involves  determining 
the  amount  of  product  produced  and  its  price.  Let  us 
redefine  this  amount  as  d,  where  d  represents  the  firm's 
production  decision  (note  that  d  =  qr  =  q^  =  q^) . 
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Substituting  (B2)  into  (Bl)  yields  the  profit 

function 

P  =  (  (b2~d)  /h\  )d  -  prd  -  a2d2  - 
then  the  first-order  condition  for  the  profit  function  is 
dP/dd  =  (b2~d)/b1  -  d/b.^  -  pr  -  2a2d 
denoting  d*  as  the  optimal  decision: 

d*  =  (l/(2{l+b1a2)))  (b2-blPr)  (2-1) 

and  the  optimal  sales  price: 

p*  =  (b2-d*)/b1  (2-2) 

Thus  the  firm  will  purchase  d*  units  of  raw 
material  at  price  pr  and  sell  d*  units  of  finished  pro¬ 
duct  at  a  price  p*.  It  is  apparent  that  these  results 
readily  reduce  to  the  classical  economic  result  that  the 
firm  should  price  its  product  such  that  marginal  revenue 
equals  marginal  cost. 

This  section  has  developed  the  results  needed 
to  solve  the  problem  we  formulated  as  problem  B.  However 
the  basic  model  we  developed  assumes  that  the  firm's  deci 
sion  problem  is  not  dynamic  in  that  future  changes  in 
price,  cost  and  the  market  demand  were  not  allowed.  In 
the  next  section  we  will  extend  the  model  to  allow  for  a 


dynamic,  changing  environment  and  begin  to  develop  the 
analytic  tools  required  to  evaluate  dynamic  optimization 
problems  using  the  techniques  of  optimal  control  theory. 


Dynamic  Model  of  the  Firm 

In  this  section  we  relax  the  demand  and  supply 
market  assumptions  inherent  in  problem  A.  The  model  will 
be  extended  to  allow  for  both  a  changing  environment  with 
respect  to  demand  for  the  firm's  product  and  also  a 
changing  price  in  the  raw  material  market.  As  a  result 
of  price  instability  in  the  world  over  the  last  ten 
years  and  no  indication  as  to  a  reversal  of  those  trends 
we  assume  that  the  price  of  raw  material,  pr,  will  tend 
to  rise  over  time.  If  we  consider  a  discrete  time 
period,  t,  we  can  represent  this  change  as 

Pr(t+1)  =  pr(t)  +  kpr ( t) 

where  the  argument  represents  the  time  period  of  interest. 
We  no  longer  consider  pr  to  be  constant,  but  rather  a 
variable  which  is  a  function  of  time  where  the  constant  k 
represents  the  rate  at  which  the  price  changes. 

Similarly,  the  firm  can  expect  to  be  exposed  to 
a  changing  demand  for  their  finished  product.  Recall  that 
the  actual  demand  q^  is  a  function  of  the  sales  price  and 
characteristic  parameters  b^  and  b2*  We  relax  the 
constancy  assumption  of  b2  and  allow  this  parameter  to 
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vary  over  time.  This  can  be  thought  of  as  a  change  in 
demand  due  to  a  change  in  market  composition  due  either 
to  brand  switching  or  a  general  increase/decrease  in  the 
number  of  market  participants.  Based  on  a  steadily  rising 
population  we  will  functionally  represent  this  demand  as 

qf(t)  =  -b1pf(t)  +  b2(t) 

where 

b2(t+l)  =  b2(t)  +  lb2(t) 

and  the  constant  1  represents  the  rate  at  which  b2  changes. 

In  a  dynamic  environment  we  are  concerned  with  the 
amount  of  product  the  firm  has  at  time,  t.  This  amount 
represents  the  difference  between  the  quantity  produced 
and  the  quantity  sold  which  we  represent  as 

h^t+l)  =  hT(t)  +  qr(t)  -  qf(t) 

where  hT  is  the  quantity  of  finished  goods  available  in 
any  period,  t.  This  relationship  allows  for  situations 
where  either  the  quantity  demanded  exceeds  the  quantity 
supplied  or  the  quantity  produced  exceeds  the  quantity 
demanded.  In  this  manner  the  model  is  extended  to  allow 
for  the  consideration  of  either  excess  inventory  or  back¬ 
log  situations.  We  assume  that  backlog  will  be  erased 
in  the  following  production  period  and  the  model 
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implicitly  penalizes  the  firm  in  such  a  situation  by 
incurring  the  attendant  rise  in  production  cost  required 
to  fill  the  backorder  units  which  are  sold  at  the  (pre¬ 
sumably)  lower  price  contracted  for  in  the  prior  period. 

In  addition,  we  penalize  the  firm  an  amount  m  per  squared 
unit  of  h^  which  can  be  interpreted  to  represent  the 
inventory  carrying  cost  or  the  potential  loss  of  cus¬ 
tomer  goodwill  due  to  the  firm' s  inability  to  satisfy  cur¬ 
rent  demand.  The  dynamic  model  of  the  firm  can  be  sum¬ 
marized  as: 

Demand  Relationship 

qf(t)  =  -blPf(t)  +  b2(t)  (2-3a) 

b2(t+l)  =  b2(t)  +  lb2(t)  ( 2-3b) 

Revenue 

R ( t)  =  pf(t)qf(t)  (2-4) 

Cost 

CM  +  Cp(t)  =CM  +al  +  pr(t)  qr(t)  +  a2qr2(t) 

(2-5a) 

pr(t+l)  =  pr(t)  +  kpr (t)  ( 2-5b) 

Profit 

P(t)  =  R ( t)  -  CM  -C  ( t )  -  mhT2(t)  (2-6) 

Quantity  Differential 

hrp  ( t-t- 1 )  =  lip(t)  -  qf(t)  +  qr  ( t) 


(2-7) 


The  firm's  problem  becomes 
Maximize : 
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P(t)  =  pf(t)qf(t)  -  CM  -  a1  -  pr(t)qr(t) 

-  a2qr2(t)  -  mhT2(t)  (C) 

Subject  to: 

qf(t)  =  -b^jCt)  +  b2(t) 

qr(fc)  =  qf(t>  -  hT(t) 

b2(t+l)  =  b2(t)  +  lb2(t) 

Pr<t+1)  =  pr(t)  +  kpr(t) 

hT(t+l)  =  hT(t)  -  qf(t)  +  qr(t) 

where  the  equation 

qr(t)  =  "  hT(fc) 

represents  a  constraint  on  the  production  decision  to 
recognize  current  inventory  assets  or  backlog  commit¬ 
ments. 

This  is  a  more  difficult  problem  to  solve  than 
the  static  model  posed  for  problem  B.  We  are  now  faced 
with  a  situation  where  the  firm  recognizes  that  it  must 
operate  in  a  constantly  changing  environment  and  wishes 
to  establish  an  optimal  decision  policy  over  some  future 
planning  period.  The  above  formulation  allows  for  future 
control  over  any  finite  number  of  periods.  For  exposi¬ 
tion  purposes  we  will  assume  the  firm's  planning  horizon 
extends  five  periods  into  the  future.  This  will  provide 
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enough  time  to  observe  dynamic  characteristics  of  the 
model  yet  not  force  the  solution  to  become  cumbersome. 

The  firm's  problem  can  be  restated  by  incorpo¬ 
rating  the  demand  relationship  into  the  model  as 

Maximize : 

-blPf2(t)  +  b2(t)Pf(t)  -  Pr(t)qr(t)  -  a2qr2(t) 
-  mhT2(t)  -  (C^a^  (C ' ) 

Subject  to: 

b2(t+l)  =  b2(t)  +  lb2(t) 

pr(t+l)  =  Pr(t)  +  kpr(t) 

hT(t+l)  =  hT(t)  +  bLpf(t)  -  b2(t)  +  qr(t) 

qr =  qf "  hT (t) - 

It  should  be  noted  that  the  model  still  does  not 
allow  us  to  address  the  issue  of  optimal  information 
structure  nor  does  it  explicitly  address  the  transfer 
pricing  problem.  On  the  other  hand,  we  can  now  discuss 
the  dynamic  decision-making  policy  a  firm  would  undertake 
if  we  could  solve  problem  C'.  In  the  next  section  we  dis¬ 
cuss  the  theory  of  optimal  control  which  will  provide  a 
means  to  solve  the  dynamic  problem  we  have  formulated. 
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Discrete-Time  Minimum  Principle 
The  theory  of  optimal  control  was  developed  pri¬ 
marily  over  the  last  two  decades.  This  development  was 
two-fold  in  that  it  began  in  this  country  through  the 
development  of  dynamic  programming  and  Bellman's  prin¬ 
ciple  of  optimality  (Bellman : 1957 ) .  Concurrently,  a 
parallel  development  in  the  Soviet  Union  using  a  differ¬ 
ent  theoretical  approach  was  carried  on  by  Pontryagin  and 
culminated  in  the  minimum  principle  v.hich  is  essentially 
an  extension  of  the  calculus  of  variations  approach 
(Pontryagin,  et  al.:1962). 

Essentially,  an  optimal  control  problem  consists 

of : 


1.  a  set  of  differential  or  difference  equa¬ 
tions  that  represent  a  system  that  is  to  be  controlled; 

2.  a  set  of  constraints  on  the  variables  of  the 

system; 

3.  a  set  of  boundary  conditions  on  the  vari¬ 
ables;  and 

4.  a  cost  functional,  or  performance  index, 
which  is  to  be  maximized/minimized. 

The  application  of  this  theory  to  our  problem  is 
straightforward.  The  system  is  represented  by  a  model  of 
the  firm,  a  set  of  difference  equations.  Our  model 
incorporates  explicit  constraints  on  the  variables  of  the 
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system  and  requires  boundary  conditions  on  the  initial 
values  of  the  variables.  Finally,  the  cost  function  is 
represented  by  the  decision  maker's  goals,  objectives  or 
utility.  The  objective  of  this  section  is  to  discuss  a 
solution  for  the  discrete-time  optimal  control  problem. 

The  state-space  approach  will  be  used  extensively  in  this 
and  remaining  sections  of  the  dissertation.  The  reader 
who  is  not  familiar  with  this  approach  should  read  Appen¬ 
dix  A  before  proceeding  and  is  also  referred  to  Ogata 
(1967)  for  an  extensive  treatment. 

Since  most  of  the  early  applications  of  control 
theory  to  engineering  problems  involved  continuous  time 
systems,  the  theoretical  foundations  for  optimal  control 
developed  most  extensively  in  the  continuous-time  form. 

The  minimum  principle  of  Pontryagin  which  provides  a  set 
of  necessary  conditions  for  the  solution  of  the  general 
continuous- time  optimal  control  problem  has  found  wide 
acceptance  and  application  to  engineering  problems.  One 
purpose  of  this  section  of  the  dissertation  is  to  discuss 
a  minimum  principle  for  discrete-time  optimal  control 
problems  that  will  be  general  enough  to  allow  application 
to  the  problems  that  will  interest  us. 

Pearson  and  Sridhar  (1966)  and  Rosen  (1967)  have 
shown  that  the  minimum  principle  could  be  approached  from 
the  point  of  view  of  Kuhn-Tucker  theory  and  that  a  dynamic 
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optimal  control  problem  could  be  expressed  and  treated  as 
a  larger  static  convex  programming  problem.  We  will  use 
their  approach  in  deriving  a  minimum  principle  for 
discrete- time  problems.  The  basic  problem  in  convex  pro¬ 
gramming  is  that  of  minimizing  the  scalar  function  J(y) 
subject  to  the  constraints  (y)  =  0  and  F2  (y)  21  0  where 
y  is  an  s-vector,  F-^(y)  is  an  n^  dimensional  vector  valued 
function  and  F^(y)  is  a  vector  valued  function  of  dimen¬ 
sion  n2>  In  addition,  the  assumptions  are  made  that  J (y) , 
F^(y)  and  F2  (y)  are  all  differentiable  in  their  arguments 
and  that  the  constraint  functions  are  convex  in  y. 

The  results  of  Kuhn-Tucker  theory  that  are  of 
interest  are  two  theorems  that  state  conditions  for  the 
solution  of  the  convex  programming  problem.  Define  the 
Lagrangian  as: 

L(y,p,u)  =  J(y)  +  pTF1  (y)  -  yTF2(y) 

where  p  and  y  are  n^  and  n2  vectors  of  Lagrange  multi¬ 
pliers.  Assume  that  y*  is  an  admissible  value  which 
satisfies  the  constraints  and  minimizes  J (y)  and  define 
the  following  vectors: 

L*  =  DL/ay^ j y* ,p* , y* ;  i  =  l,2,...,s 

L*  =  9L/3pi|y*;  i  =  1,2,...,^ 

L*  =  3L/9yi|y*;  i  =  1,2, ...,n2 
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L*  is  the  gradient  vector  of  the  Lagrangian  with 
respect  to  y  (i.e.,  each  of  the  s  components  of  y)  evalu¬ 
ated  at  y  =  y*,  p  =  p*,  and  y  =  y*;  i.e.,  at  the  value 

that  minimizes  J (y)  while  satisfying  the  constraints 
F-^y)  =  0  and  (y)  _>  0  and  the  corresponding  values  of 

p  and  y.  L*  and  L*  are  the  gradient  vectors  with  respect 

to  p  and  y,  evaluated  at  y  =  y*. 

For  convenience,  define  the  two  matrices: 

Fly = 3Fli/3yj ^ y*?  1  =  1,2'**,,nl?  ^  =  1'2'"‘*'s 
F2y = 3F2i/3yj I y*;  1  =  nl+1' * ' ' 'nl+n2? 


Thus  we  have 

L*  =  3 J/3y | y*  +  <F*y)T  p*  -  (F*y)Ty* 

L*  =  Fx(y*) 

L*  =  -F2(y*). 

The  two  Kuhn-Tucker  theorems  of  interest  are: 

Theorem  I : 

If  y*  minimizes  J (y)  subject  to  F^ (y)  =  0  and 
F2(y)  0,  then  it  is  necessary  that  there  exist 

some  p*  and  y*,  so  that  the  following  are  satis- 


f  ied : 
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L*  =  0 

y 

L*  =  0 
p 

L*  <  0 
M  - 

(L*)Tm*  =  0 

u*  >.  0. 

Theorem  II : 

If  y*  minimizes  J(y)  subject  to  F^ (y)  =  0  and 

F2(y)  ■>  0 ,  it  is  sufficient  that  Theorem  I  holds 

and 

L(y,p*,y*)  >  L(y*,P*,U*)  +  (L*)T(y-y*). 

(Note  that  Theorem  II  is  merely  a  convexity  con¬ 
dition  on  L. ) 

Theorem  I  gives  us  a  set  of  necessary  conditions 
for  the  solution  to  the  optimization  problem  that  may 
admit  several  extremal  solutions;  however,  Theorem  II 
states  that  if  the  optimization  problem  is  such  that  the 
Lagrangian  has  a  unique  minimum  with  respect  to  y,  then 
there  is  only  one  extremal  solution  and  Theorem  I  gives 
sufficient  conditions  for  an  optimum. 

Now  we  outline  the  optimal  control  problem  as 
composed  of  the  system 

x(t+l)  -  x(t)  =  f (x (t) ,u(t) ,t)  ;  t  =  0,1,.  .  .  ,N 


where  x(t)  is  now  an  n-vector  and  u(t)  is  an  r-vector. 
The  system  is  subject  to  the  initial  conditions 
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x ( t=0 )  =  x (0 ) 

and  the  final  condition 

t  =  N. 

We  want  to  minimize  the  cost  functional 

N-l 

J  =  K (x (N)  )  +  £  L(x(t)  ,u(t)  ,t) 

t=  0 

and  we  require  that  the  sequence  {x(t),u(t)}  belongs  to 
the  constraint  set  respresented  as 

P  (x  (t)  ,u  (t)  ,  t)  _>  0 

where  p  is  a  vector-valued  function  of  dimension  m. 

To  apply  the  Kuhn-Tuckei  theorems,  the  optimal 
control  problem  must  be  restated  as  a  convex  programming 
problem.  To  do  this,  define  the  (n+r)N  =  sN  vector  y  as 

y  =  [x  (1)  ,  .  .  .  ,x  (N)^u(O)  , .  .  .  ,u  (N-l)  ]T. 

Next  define  the  nN  vector  (y)  as 


h 
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f (x ( 0 )  , u (0)  ,0)  -  x(l)  +  x  (0 ) 
f  (x(l)  ,u(l)  ,1)  -  x(2)  +  x(l) 

F1(y)  = 

f  (x(N-l)  ,  (N-l)  ,N-1)  -  x  (N )  +  x(N-l) 

and  the  mN  vector  F2(y)  as 

p  (x  (0),  u  ( 0)  ,0) 

f2(y)  = 

p  (x  (N-l)  ,  u  (N-l)  ,N-1) 

Thus,  the  optimal  control  problem  is  equivalent 
to  minimizing: 

N-l 

J(y)  =  K(x(N))  +  Z  L  (x  (t )  ,  u  ( t)  ,  t) 

t=0 

subject  to: 

Fx(y)  =  0 
F2(y)  >  0. 

We  can  now  apply  the  Kuhn-Tucker  theorems  stated 
earlier  where  we  define  the  Lagrangian  as 

L(y»p#y)  =  J (y)  +  pT  Fx(y)  -  pT  F2(y) 

However,  p  is  now  an  nN~ vector  and  u  is  an  mN- 


vector  of  Lagrange  multipliers.  Application  of  the 


Let  (x*(t)l  be  the  trajectory  of  the  dynamic 


system  of  interest  corresponding  to  the  control  sequence 

{ u* (t) }  with  x*(t=0)  =  x{0)  and  { x* (t) ,u* (t) }  constrained 

to  the  set  of  p  (x  (t)  ,u  (t)  , t)  >_  0.  Then  if  {u*(t)}  mini- 

N-l 

mizes  the  cost  functional  J  =  K(x(N))  +  I  L(x(t), 

t=0 

u(t),t),  it  is  necessary  that  there  exists  a  sequence 
of  n  vectors  {p* (t) ;  t  =  0,1,..., N}  called  the  co-states, 
and  a  sequence  of  m  vectors  {p*(t);  t  =  0,1,..., N}  called 
the  co-constraint  vectors  such  that: 

1.  The  scalar  function 

H(x*(t)  ,p*(t+l) ,u(t)  ,  y*  (t+1) ) 

=  L (x* (t) , u ( t ) ,t) 

+  (p* (t+1) )Tf (X* (t) ,u(t) ,t) 

-  (M*  (t+1) )Tp(x* (t) ,u(t) ,t)  (2-8) 

called  the  Hamiltonian  is  minimized  as  a  function  of  u(t) 
at  u(t)  =  u*(t)  for  all  t  =  0,1,...,N-1. 

5 

The  interested  reader  is  referred  to  Pindyck 
(1973)  for  the  algebraic  details. 
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2.  The  dynamics  of  x*  (t)  ,  p* (t)  ,  andy*(t)  are 
determined  by  the  equations 

x*(t+l)  -  x*  (t)  =  3h/  3p(t+l)|*  =  f  (x*  (t)  ,u*  (t)  ,t) 


(2-9) 

p*(t+l)  -  p*(t)  =  -3H/3x(t)|*  (2-10) 

p(x*(t)  ,u*(t)  ,t)  =  -3H/3y(t+l)  |*  >.  0  (2-11) 

y*  (t)  >_  0  (2-12) 

pT(x* (t)  ,u* (t)  ,t)y*  (t+1)  =0  (2-13) 

p* (N)  =  3K(x*(m) )/3x.  (2-14) 

Model  Revisited 


For  case  of  reference  we  restate  the  firm's  prob¬ 
lem  as  defined  by  equations  C': 

Maximize : 

-b1pf2(t)  +  b2(t)pf(t)  -  pr(t)qr(t)  -  a2qr2(t) 
-  mhT2(t)  -  (CM+a]L)  (C) 

Subject  to: 

b2(t+l)  =  b2(t)  +  lb2(t) 
pr(t+l)  =  pr(t)  +  kpr (t ) 
hT(t+l)  =  hT(t)  -  b2(t)  +  blPf(t)  +  qr(t) 

qr(t)  =  qf (t)  "  hT(t) * 
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Recall  that  this  model  assumes  the  availability 
of  perfect  information  on  which  the  centralized  form  will 
base  its  production  decision  and  product  line  pricing 
policy.  For  mathematical  convenience  we  will  restate 
problem  C'  in  state-space  representation  which  will  enable 
us  to  formulate  the  firm's  problem  as  an  optimal  control 
pronlem.  Let  us  define  the  three-dimensional  state  vec¬ 
tor 


b2  (t) 

xx  (t) 

demand  parameter 

X  (t)  = 

Pr  (t) 

= 

x2  (t) 

= 

raw  material  price 

hT(t) 

*3(t)_ 

inventory /backlog 

Similarly,  we  define  the  two-dimensional  decision/ 
policy/control  vector  as 

production  decision 
pricing  decision 

We  can  now  generate  the  state-space  representa¬ 
tion  of  the  model  of  the  firm  as 

x ( t +1 )  -  X ( t )  =  f (x (t) ,U(t) , t) 


qr  (t) 

ux  (t) 

Pf  (t) 

u2 

where  the  (3x1)  vector  valued  function  f  is  defined  as 
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f (x (t)  ,u  (t)  ,t) 


lx1(t) 

k*2 (t ) 

-x1(t)  +  b1u2(t)  +  u1 (t) 


We  note  that  the  model  of  the  firm  can  be 
expressed  in  linear  form  as 


x(t+l)  -  x(t)  =  Ax(t)  +  Bu(t) 


(2-15) 


where  the  (3x3)  matrix  A  and  the  (3x2)  matrix  B  are 
defined  as 


f— 1 

1 

0 

0 

0  0 

A  = 

0 

k 

0 

;  B  = 

0  0 

-1 

0 

0 

!  bi 

The  dynamic  model  of  the  firm  we  have  developed 
is  completely  described  by  the  vector-matrix  difference 
equation  (2-15) .  The  cost  function  was  deixned  earlier 
as 

N-l 

J  =  K  (x  (N)  )  +  E  L(x(t)  ,u(t)  ,t)  . 
t=0 


We  can  convert  the  firm's  objective  of  profit 
maximization  to  the  analogous  cost  minimization  problem 
by  defining 


T 


N-l 

J  =  -  (1/2) K (x (N) )  -  (1/2)  Z  P(t) 

t=0 


where  P(t)  is  defined  by  equation  (2-6)  and  K  defines  the 
final  state  of  the  model  which  yields 


J  =  (1/2)  1  <CM  +  Cp ( t)  +  mh_  (t)  -  R (t) ) 

t=0  1 


+  (l/2)mhT^(N) 


which  can  be  written  explicitly  as 


J  =  (1/2)  Z  (blPfz(t)  -  b2(t)pf(t)  +  pr(t)qr(t) 
t— 0 


+  a2qr2(t)  +  mhT2(t)  +  (CM~i-a1)  ) 


+  (1/2)  mhT‘(N) . 


Converting  to  state-space  representation  yields 


(1/2)  Z  (t)  -  xx(t)u2(t)  +  x2(t)u1(t) 

+  a2ui2(t)  +  mx32(t)  +  (CM+a^) ) 

+  (l/2)mx32(N) 


which  can  be  written  in  matrix  form  as 


I  I  ■¥  V  I*  • 


3' 


J 


(1/2)  Z  ( x  ( t )  Qx  ( t )  +u  (t)Ru(t) 

t=0 

+  xT(t)Su(t)  -4-  C )  +  (l/2)xT  (N)Qx(N) 


where  x(t)  and  u(t)  are  the  state  and  control  vectors  pre- 
variously  defined  and  C  =  constant  scalar  (CM+a^)  with 


Q 


0  0  0 

0  0  0 

0  0m 


a 


2 


R  = 


0 


S 


0 

1 

0 


-1 

0 

0 


We  now  restate  the  firm's  problem  as  an  optimal 
control  problem  (problem  D)  in  which  we  want  to  select 
a  control  sequence  (u*(t)}  such  that  the  function 


N-1  m  m 

J  =  (1/2)  Z  (xA (t) Qx (t)  +  uA(t)Ru(t) 
t=0 


+  xT(t)Su(t)  +  C)  +  (l/2)xT  (N)Qx(N)  (D) 


is  minimized  and  the  system 


x  ( t+1 )  -  x(t)  =  Ax  (t)  +  Bu  (t) 
is  subject  to  the  initial  condition 
x  (t=0)  =  x (0) 

and  the  final  condition 

t  =  N 

and  the  quantity  constraint 

qr(t)  =  qf (t)  "  hT<t)  * 

We  can  apply  the  minimum  principle  we  developed 
in  the  previous  section  to  determine  an  optimal  planning 
policy  for  the  multi-period  model. 

Dynamic  Optimization 

Application  of  the  discrete-time  minimum  prin¬ 
ciple  to  our  multi-period  model  yields  the  following 
optimal  planning  policy:** 

'  u* (t)  =  J  x* (t)  (2-16a) 

where 

J  =  (1/2) (DR'1DT) _1  R~1DTDR"1ST- (1/2)R"1ST 
-(DR“1DT)"1R~1DTC. 

6See  Appendix  B  for  derivation. 


(2-16b) 
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This  relationship  shows  that  the  optimal  planning 
decisions  are  a  linear  function  of  the  current  state  of 
the  system.  For  our  model  J  becomes 


J  =  l/(2(l+a2b1) ) 


1  -bL  -2 

(l+2a2b1)/b1  1  -2a2 


thus  the  optimal  planning  policy  is  determined  by 


1  -b,  -2  1 

u*  (t)  =  1/  (2  (1+a-b, )  ) 

1 

x*  (t) 

4 

(l+2a2b1)b1  1  -2a2 

(2-17) 

If  we  assume  that  the  initial  inventory /backlog 
is  zero  and  ignore  the  dynamics  of  our  model,  we  note 
that  the  optimal  planning  policy  becomes 


u*  =  l/(2(l+a2b1)) 


-bl  -2 


(l+2a2b1)/b1  1 


-2a. 


u*  =  l/(2(l+a2b1)) 


b2  -  blPr 

qr 

(b2  (l+2a2b1) ) /bj^  +  pr 

_pf_ 

(2-18) 


Expanding  equation  (2-18)  we  observe  that  the 
optimal  production  decision  is 


qr  “  2(l+a2b1)  (b2_blpr) 


(2-18a) 


This  result  is  identical  to  the  classical  result  obtained 
in  equation  (2-1) .  We  note  that  the  optimal  production 
decision  is  linearly  related  to  the  demand  parameter  b2 
in  a  positive  manner  such  that  an  increase  in  b2  (which 
implies  a  change  in  demand  schedule)  results  in  an 
increase  in  the  production  decision.  This  result  is 
intuitively  appealing  since  we  would  expect  an  increase 
in  the  quantity  demanded  to  result  in  an  increase  in  the 
production  decision.  The  second  demand  parameter  b^ 
(recall  that  we  have  posited  a  linear  demand  function  of 
the  form  q^.  =  -b^p^+b2)  appears  in  both  the  numerator  and 
denomerator  of  the  optimal  production  decision.  We  note 
that  a  perfectly  competitive  external  market  requires  that 
b-j.  =  0.  In  this  limiting  case,  the  optimal  production 
decision  becomes  b2/2  which  represents  the  upper  limit  for 
optimal  production,  since  an  increase  in  b^  reduces  the 
magnitude  of  the  numerator  and  increases  the  magnitude  of 
the  denominator.  Thus,  as  b^  increases  (which  implies 
that  customer  demand  is  becoming  more  sensitive  to  pricing 
considerations)  the  production  decision  becomes  more  con¬ 
servative.  This  result  captures  a  conservative  aspect  of 
the  model  that  adjusts  the  production  decision  downward 
(which  results  in  a  hedge  against  the  risk  of  losses  due 
to  overproduction)  as  the  demand  for  the  product  becomes 
more  volatile. 


Further  analysis  of  the  optimal  production  deci¬ 
sion  reveals  the  expected  results  with  respect  to  the 
production  cost  components.  Increases  in  the  raw 
material  cost,  p^,  or  the  internal  variable  cost  param¬ 
eter,  a2,  both  result  in  reductions  in  production. 

Further  expansion  of  equation  (2-18)  reveals  the 
optimal  pricing  decision  as 

Pf  =  <b2-q£)  *  (2-18b) 

This  result  is  identical  to  the  classical  result  of  equa¬ 
tion  (2-2)  .  We  observe  that  demand  parameters  b2  and  b^ 
appear  in  the  numerator  and  denominator,  respectively. 
Thus  we  observe  the  same  affect  on  the  optimal  pricing 
decision  as  was  seen  for  the  optimal  production  decision. 
That  is,  an  increase  in  the  demand  schedule  results  in  an 
increase  in  the  product  price  whereas  a  consumer  market 
that  becomes  more  volatile  with  respect  to  pricing  con¬ 
siderations,  results  in  a  reduction  of  the  price  of  the 
product.  These  results  are  intuitively  appealing  in  that 
we  tend  to  observe  large  firms  acting  in  the  manner  dis¬ 
cussed  here.  We  also  note  that  the  dynamic,  multi-period 
results  can  be  readily  reduced  to  the  classical  economic 
results  that  the  firm  should  price  its  product  such  that 
marginal  revenue  equals  marginal  cost.  The  multi-period 
analysis  under  conditions  of  certainty  can  be  iteratively 
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solved  as  a  static  problem  in  each  period  using  the 
results  of  equation  (2-18).  In  fact,  if  our  purpose  was 
to  design  a  planning  model  under  conditions  of  certainty, 
we  could  have  done  so  without  the  use  of  an  optimal  con¬ 
trol  theory  approach.  However,  we  are  interested  in 
developing  a  control  technique  which  will  result  in 
encouraging  decision  makers  to  act  in  such  a  manner  as 
to  implement  decisions  that  are  in  conformance  with  some 
overall  corporate  plan.  For  illustrative  purposes. 


assume  that  the 

firm  has 

decided 

on  the 

following 

f  ive- 

year  corporate 

plan: 

Year  1 

•  Year  2 

Year  3 

Year  4 

Year  5 

Production  Level 

1448 

1541 

1639 

1743 

1853 

Product  Price 

$35520 

$37090 

$38733 

$40452 

$42250 

The  above  plan  was  actually  generated  using  the 
planning  model  developed  in  this  chapter,  conditioned  on 
a  current  raw  material  price  of  $20750,  demand  parameters 
b^  =  0.1  and  b£ (current)  ~  5000,  growth  rates  for  1  and  k 
of  5  percent  and  3  percent  respectively  and  a  penalty, 

7 

m  =  $10000.  The  actual  planning  process  that  management 
uses  to  arrive  at  the  desired  targets  is  not  crucial  to 
this  discussion.  The  essential  problem  we  are  interested 
in  is,  "Given  the  planning  objectives,  can  we  derive  a 

^See  Appendix  C  for  computation  of  plan. 


control  technique  which  will  result  in  decentralized 
decisions  that  result  in  conformance  to  the  overall 
corporate  plan?" 

In  this  chapter  we  have  developed  a  model  for 
centralized  decision  making.  In  the  next  chapter  we 
extend  the  model  to  incorporate  decentralized  decision 
making  and  develop  a  control  technique  that  will  encourage 
decentralized  decision  makers  to  implement  actions  that 
are  not  only  in  their  best  interests,  but  also  in  the  best 
interest  of  the  firm  overall. 


CHAPTER  III 


DECENTRALIZED  CONTROL  MODEL 

Introduction 

In  Chapter  II  we  developed  an  optimal  planning 
model  for  a  firm  that  operates  in  a  deterministic  cen¬ 
tralized  decision-making  environment.  In  this  chapter  we 
will  develop  a  control  technique  that  will  encourage 
decision  makers  to  achieve  the  planning  objectives  which 
the  firm  has  established.  This  control  technique  will 
then  be  applied  to  a  decentralized  extension  of  the  model 
developed  in  the  last  chapter.  The  control  technique  will 
be  evaluated  with  respect  to  its  ability  to  encourage 
decentralized  decision  makers  to  act  in  a  manner  that  is 
in  the  best  interest  of  the  firm  as  a  whole. 

Control  Technique 

In  this  section  we  discuss  a  control  technique 
that  has  been  used  in  the  design  of  physical  systems  as  a 
result  of  the  application  of  optimal  control  theory  to  a 
linear  dynamic  system  with  a  quadratic  performance  index 
used  as  the  instrument  to  measure  the  desired  performance 
of  the  system.  The  control  technique  is  also  applicable 
to  stochastic  systems  (as  we  shall  discuss  later)  and  is 
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commonly  referred  to  as  the  Linear-Quadratic-Gaussian 

O 

(LQG)  problem.  A  discrete  version  of  the  control  philos¬ 
ophy  inherent  in  the  LQG  problem  with  emphasis  on  eco¬ 
nomic  system  analysis  was  presented  in  1972  (Athans : 1972) . 
Since  that  time  economists  have  applied  the  control 
approach  to  numerous  problems  concerned  with  the  stabili- 

9 

zation  and  control  of  economic  systems.  The  basic  con¬ 
trol  mechanism  is  a  performance  index  intended  to  encour¬ 
age  conformance  with  some  predetermined  nominal  plan.'*'® 
Kornai  and  Simonovits  (1977)  have  addressed  this  control 
philosophy  by  defining  a  real  sphere  and  a  control  sphere 
where  the  real  sphere  consists  of  the  dynamic  model  and 
its  desired  objectives  and  the  control  sphere  involves  a 
penalty  function  used  to  measure  the  performance  of  the 
system.  The  essential  idea  of  the  control  technique  is 
to  define  a  penalty  function,  quadratic  in  form,  that  will 
punish  deviations  from  a  desired  plan.  The  quadratic  form 
results  in  small  penalties  for  small  deviations  and 

Q 

For  a  comprehensive  survey  of  engineering  appli¬ 
cations,  see  the  Special  Issue  of  IEEE  Transactions  on 
Automatic  Control  (Athans : 1971) . 

9 

For  a  survey  of  economic  applications,  see 
Kendrick  (1976) . 

10As  a  historical  point,  it  is  interesting  to  note 
that  the  basic  concept  of  LQG  control  was  initially  intro¬ 
duced  in  an  industrial  setting  by  Holt,  et  al.  in  1960. 
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increasingly  higher  penalties  for  more  significant 
deviations.  In  the  next  section  we  will  develop  a  con¬ 
trol  model  for  decision-making  using  the  LQG  approach 
(with  the  exception  that  our  model  does  not  yet  explicitly 
incorporate  uncertainty  considerations) . 

Control  Model 

We  expressed  the  dynamic  model  of  the  central¬ 
ized  firm  as: 

b2(t+l)  -  b2(t)  =  1  b2(t) 

pr(t+l)  -  pr(t)  =  k  Pr(t) 

hT(t+l)  -  h^  (t )  =  -b2(t)  +  b2pf(t)  +  qr(t) 
subject  to  the  constraint 

qr(t)  =  qf(t)  "  hT(t* 

and  developed  an  optimal  control  problem  in  which  we 
wanted  to  select  a  decision  policy,  {u*(t)>  ,  such  that 
the  function 

N-l 

J  =  (1/2)  I  (x'(t)Qx(t)  +  u'(t)  Ru (t ) 
t=0 

+  x ' (t) Su (t)  +  C)  +  (l/2)x' (N)Qx(N) 

was  minimized  subject  to  the  dynamic  model  of  the  firm 
and  the  static  constraint. 
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Application  of  the  discrete  minimum  principle 
resulted  in  an  optimal  policy  for  the  firm  (Eqn.  2-18) . 
This  policy  was  used  to  generate  sample  corporate  targets 
for  production  levels  and  product  pricing.  We  extend  our 
model  to  introduce  the  concept  of  separation  of  ownership 
and  managerial  decision  making  by  defining  a  predetermined 
plan  specified  by  the  owner  of  the  firm.  We  assume  that 
the  (centralized)  decision  maker  attempts  to  achieve  the 
owner's  objectives  due  to  an  agreed  upon  incentive 
arrangement  based  on  penalizing  the  decision  maker  for 
deviations  from  the  owner's  plan.  This  concept  can  be 
formulated  as  a  control  problem  by  defining  a  penalty 
function  of  the  form: 

N-l 

J  =  (1/2)  Z  (  (x(t)-x(t)  )*C>  (x(t)-x(t)  ) 
c  t=0  c 

+  (u(t)-u(t)  ) 'Rc(u(t)-u(t))) 

+  1/2 (x (N) -x (N) )  Qc (x (N) -x (N) )  (3-1) 

where  the  planning  objectives,  x(t)  and  u(t),  are  incorpo¬ 
rated  into  the  penalty  function  and  the  matrices,  Qc  and 
Rc,  are  used  to  "weight"  the  relative  importance  of  both 
state  deviations  and  control/decision  deviations  (note 
that  these  are  not  the  same  Q  and  R  matrices  defined 
earlier) .  This  penalty  function  must  be  minimized 
subject  to  the  actual  dynamic  model  of  the  firm  developed 
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earlier ;  i . e .  , 

x  ( t+ 1 )  -  x ( t )  =  A  x ( t )  +  B  u(t)  (E) 

where 


The  solution  to  this  problem  can  be  obtained 
using  the  discrete  minimum  principle  in  the  same  manner 
as  was  done  in  Appendix  B.  The  resulting  optimal  control 
policy  for  the  decision  maker  to  follow  is 

U* (t)  =  -(R  +B'K(t+l)B)“1B'K(t+l) ( (I+A)x*(t) 

+  Bu ( t) )  +  u(t)  (3-2) 

K (t)  =  Qc  +  (I+A) ' (K(t+1)-K(t+1) 

B(R  +B,K(t+l)~1B'K(t+l)) (I+A)  (3-3) 


The  selection  of  the  weighting  matrices  in  the 
quadratic  criterion  is  not  a  simple  matter.  These 
matrices  provide  a  mechanism  to  operationalize  the  tech¬ 
nique  by  which  the  owner  provides  an  incentive  arrangement 
based  on  his  preferences.  It  should  be  noted  that  in 
this  control  model,  the  goals  x(t)  and  u(t)  need  not  be 
dependent  upon  each  other,  nor  do  they  have  to  be 
generated  using  the  dynamic  system  (E) .  Furthermore,  the 
static  constraint  was  not  considered  in  the  development 
of  the  control  model.  We  will  have  more  to  say  concern¬ 
ing  the  incentive  arrangement  (weighting  matrices)  when 
we  extend  our  model  to  a  decentralized  environment.  In 
most  practical  applications  we  will  select  Q  and  R  to 
be  diagonal.  In  this  manner  specific  components  of  the 
state  deviations  and  control  deviations  can  be  weighted 
individually  and  their  impact  can  be  assessed  quantita¬ 
tively. 

As  a  check  on  our  model  we  use  the  plan  developed 
in  Chapter  II  and  (somewhat  arbitrarily)  set 
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This  incentive  arrangement  weights  the  deviations  from 
quantity  differential,  price  and  production  quantity 
equally  and  assigns  a  zero  weight  to  the  demand  param¬ 
eter  and  the  input  product  price.  Since  the  firm  has  no 
control  over  these  last  two  variables,  we  would  not 
expect  to  see  any  incentive  arrangement  with  respect  to 
the  raw  material  price  or  the  demand  parameter,  b2 (t)  . 

Using  Eqns.  (3-2)  through  (3-4)  we  find  that 

u*(t)  *  u(t).  (3-5) 

Thus  the  control  model  we  developed  results  in 
actions  by  the  decision  maker  which  follow  the  owner's 
plan  exactly.  Although  this  simple  example  verifies  the 
correctness  of  the  control  model  it  does  not  approximate 
reality  in  that  the  owner  seldom  specifies  exactly  the 
quantities  of  production  and  the  sales  price  (if  this 
could  be  done  the  role  of  the  decision  maker  would  no 
longer  be  required) .  Instead,  we  observe  targets  in  the 
form  of  cost  performance  (budgets)  and  revenue  performance 
(sales  quotas) .  Thus  the  owner  can  establish  targets 
for  cost  and  revenue  without  the  detailed  knowledge  of 
internal  performance  parameters.  The  decision  maker  would 
then  internalize  these  goals  by  defining  explicit  objec¬ 
tives  internal  to  the  firm.  For  example,  if  the  owner 
establishes  a  revenue  target  of  r(t) ,  the  decision  maker, 


having  full  knowledge  of  the  firm's  operations,  will 
internalize  the  goal  of  pf(t)qf(t)  =  r(t)  and  thereby 
establish  an  internal  target  pf (t) .  In  this  way  the  con¬ 
trol  model  can  be  generalized  to  allow  the  owner  to  estab 
lish  incentive  mechanisms  for  targets  which  he  desires 
the  decision  maker  to  achieve  and  the  decision  maker 
(through  the  model  dynamics  and  knowledge  of  the  inter¬ 
relationships  internal  to  the  firm)  will  internalize 
those  targets  by  defining  explicit  internal  objectives. 

Decentralized  Model  of  the  Firm 
Figure  3.1  extends  the  earlier  framework  (Figure 
2 r 1)  to  address  a  decentralized  organization. 


Fig.  3.1.  Decentralized  Model  of  the  Firm 

This  framework  incorporates  two  significant  extensions  of 
the  earlier  model.  First,  an  amount  qM  is  transferred 
from  Division  P  to  Division  M  for  which  Division  M  pays 
Division  P  an  amount  pT,  the  transfer  price.  Second, 
Division  P  has  the  decentralized  authority  to  make  the 
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production  decision,  qr,  and  the  transfer  pricing  deci¬ 
sion,  pT«  The  manager  of  Division  M,  in  turn,  has  the 
decentralized  authority  to  decide  on  the  amount  of  the 
transferred  good,  qM,  he  will  purchase  and  the  external 
product  pricing  decision,  pf.  Recent  literature  on  trans¬ 
fer  pricing  (see  Chapter  I)  has  essentially  advocated 
some  form  of  negotiation  scheme  involving  an  iterative 
process  which  may,  or  may  not,  involve  corporate  head¬ 
quarters.  However,  these  analyses  have  primarily  dealt 
with  a  deterministic,  static  environment.  The  dynamic, 
multi-period  analysis  we  are  investigating  will  tend  to 
minimize  any  "gaming"  strategies  during  a  negotiation 
process  since  there  is  ample  time  for  future  "settling 
up."  Since  any  iterative  process  is,  by  definition,  time- 
consuming  we  wish  to  investigate  the  possibility  of  estab¬ 
lishing  some  incentive  scheme  that  will  encourage  the 
decentralized  decision  makers  to  arrive  at  decisions  that 
are  in  the  best  interest  of  the  firm  overall  without 
requiring  a  time-consuming  process.  A  significant  premise 
of  this  dissertation  is  that  the  decentralized  decision 
makers  can  be  provided  incentives  such  that  it  is  in  their 
best  interest  to  work  as  a  team  to  achieve  the  overall 
corporate  goals.  The  team  approach  was  introducted  by 
Marschak  and  Radner  (1972)  .  The  team  concept  essentially 
suggests  that  functional  behavior  in  a  large  decentralized 
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organization  dominates  any  dysfunctional  behavior,  such 
that  inherent  game-theoretic  organizational  traits  can  be 
disregarded  for  analytic  purposes. 

Our  decentralized  model  also  assumes  that  the 
intermediate  market,  i.e.,  the  internal  transfer  of  goods, 
is  restricted  to  one  buyer  (Division  M)  and  one  seller 
(Division  P)  and  can  thus  be  considered  as  a  bilateral 
monopoly.  The  familiar  Edgeworth  Box  analysis  of  this 
situation  dictates  that  a  mutually  satisfactory  solution 
will  lie  along  a  contract  curve  reached  as  a  result  of 
mutual  benefit  to  both  the  buyer  and  the  seller.  Although 
this  form  of  equilibrium  (referred  to  as  Pareto  optimal¬ 
ity)  can  be  achieved  in  principle,  it  is  not  apparent  that 
a  Pareto  optimal  solution  would  be  in  the  best  interests 
of  the  firm  as  a  whole.  It  is  well  known  that  the  classi¬ 
cal  economic  solution  for  this  situation  is  indeterminate 
without  the  addition  of  negotiation  or  an  incentive  scheme. 
Dopuch  and  Drake  (1964)  have  shown  that  other  market 
situations  readily  lead  to  optimal  solutions  which  dic¬ 
tate  the  use  of  the  prevailing  market  price  if  the  inter¬ 
mediate  market  is  perfectly  competitive  or  the  use  of 
Hirshleifer * s  procedure  (Hirschleifer ; 1956 )  if  the  compe¬ 
tition  is  imperfect. 

Our  decentralized  model  generates  a  dynamic  system 
in  which  several  decision  maker's  actions  will  jointly 
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affect  the  dynamic  behavior  of  the  system.  The  decision 
makers  will  base  their  actions  on  partial  and  (in  the 
sequel)  imperfect  information  on  the  various  states  of  the 
dynamic  system  on  each  other's  actions.  In  a  decentral¬ 
ized  environment  we  envision  the  production  manager  to 
have  access  to  current  information  that  is  not  available 
to  the  marketing  manager,  internal  production  cost  infor¬ 
mation,  for  example.  Similarly,  we  would  expect  that  the 
marketing  manager  has  access  to  current  market  information 
not  available  to  the  production  manager.  Thus,  the  infor¬ 
mation  necessary  to  make  "optimal"  decisions  is  decen¬ 
tralized  and  is  not  available  in  any  one  place.  This 
situation  represents  a  radical  departure  from  Walrasian 
systems  in  which  all  the  necessary  information  is  assumed 
to  be  available  to  the  auctioneer  or  to  the  central  agency 
(headquarters) .  Since  all  of  the  needed  information  is 
not  available  in  any  one  place,  the  control  of  a  decen¬ 
tralized  organization  is  more  difficult  than  for  a  cen¬ 
tralized  organization.  A  decentralized  information  pat¬ 
tern  implies  certain  structural  restrictions  on  control 
policies.  This  lack  of  centralized  information  requires 
a  degree  of  cooperation  among  decision  makers  so  that 
their  actions  can  be  coordinated  to  work  together  to  con¬ 
trol  the  decentralized  dynamic  system.  Thus  the  problem 
of  controlling  a  decentralized  organization  involves  team 


55 


decision  making  which  is  a  special  case  of  the  theory  of 
teams  (Aoki:1976). 

The  dynamic  model  of  the  decentralized  firm  can 
be  expressed  as 

Pr(t+1)  =  (l+k)pr(t) 

b2(t+l)  =  (l+l)b2(t)  (F) 

hTM(t  +  1)  =  hTM(t)  +  qM(t)  '  qf(t) 

=  hTM<t)  +  <3M<t)  +  biPf<fc)  "  b2(t) 
hTp(t+l)  =  hTp(t)  +  qr(t)  -  qM(t) 

where  hTM<t)  and  hTp(t)  represent  the  difference  between 
the  quantity  of  goods  "produced"  and  the  quantity  sold 
for  the  marketing  and  production  divisions,  respectively. 
The  dynamics  of  the  decentralized  model  become  more  com¬ 
plex  than  that  of  a  centralized  firm  due  to  the  addition 
of  an  additional  dynamic  equation  (due  to  quantity  differ¬ 
entials)  .  Furthermore,  decisions  are  now  made  by  differ¬ 
ent  decision  makers;  i.e..  Division  P  has  control  of  qr(t) 
and  pT(t)  whereas  Division  M  has  control  of  decisions 
qM(t)  and  pf (t) . 

As  discussed  earlier,  this  model  does  not  have  a 
determinate  optimal  economic  solution.  To  determine  the 
explicit  impact  of  transfer  pricing  policy  on  the  decen¬ 
tralized  decision  maker's  actions,  we  modify  the  model  to 
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enable  us  to  evaluate  the  implications  of  transfer  pricing 
policy  by  describing  the  transfer  price,  p^,  as  an  addi¬ 
tional  state  variable  as  opposed  to  a  decentralized  deci¬ 
sion  variable.  This  modification  results  in  the  follow¬ 
ing  decentralized  decision-making  model: 

Pr(t+1)  -  pr(t)  =  kpr(t) 

PT(t+l)  -  pT(t)  =  0pT(t) 

b2(t+l)  -  b2(t)  =  lb2  (t)  (G) 

hTM(t+1)  "  hTM(t)  =  qM(t)  +  blPf(t)  "  b2(t) 
hTp(t+l)  -  hTp(t)  =  q1  (t)  -  qM(t). 

The  interpretation  of  this  model  is  that  the 
transfer  price,  pT  < t ) ,  is  no  longer  controllable  by  a 
decentralized  decision  maker  but  has  been  established  by 
the  corporate  headquarters.  In  this  manner,  the  effect 
of  exogenous  transfer  price  changes  can  be  evaluated  with 
respect  to  their  impact  on  decentralized  decision  making. 
In  addition,  we  have  not  burdened  this  model,  (G) ,  with 
any  of  the  many  possible  constraints  that  could  enter 
into  an  internal  action  because  we  wish  to  minimize  any 
informational  requirements  inherent  in  the  model  and  allow 
as  much  flexibility  as  possible  for  decentralized  deci¬ 
sion  making.  The  decentralized  decision-making  model 
can  be  represented  as 


(H) 


x(t+l)  -  x  ( t )  =  A  x  (t)  +  E  B.u. 

i=l  1  1 


where 


In  this  model  the  decentralized  decision  making 
is  represented  by  u^t)  -  the  production  division  deci¬ 
sions,  and  u2(t)  -  the  marketing  division  decisions. 

Information  Structure 

The  decentralized  aspects  of  information  avail¬ 
ability  will  be  operationalized  by  defining  an  informa¬ 
tion  structure,  I j (t) ,  where  j  =  1,2  for  the  representa 
tive  decision  makers  and 
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Ij  (t)  =  {y-  <t)  ,  Y  { t—  1 )  ,  u(t-l)  } 

where  (t)  represents  the  current  information  available 
to  decision  maker  j  and  Y(t-l),  u(t-l)  represent  the  past 
history  of  the  organization;  that  is  Y  ( t — 1 )  =  (y(l),y(2), 
...,y(t-l)}  and  u(t-l)  =  { u (1) ,u (2) , . . . ,u (t-1) } .  This 
information  structure  is  referred  to  as  one-step  delay 
sharing  information  structure  in  the  information  and  con¬ 
trol  theory  literature  (Witsenhausen; 1971) .  Thus  each 
division  has  access  to  some  subset  of  the  current  state 
information  and  is  aware  of  prior  states  and  decisions. 
This  situation  closely  models  the  physical  environment 
where  a  decentralized  divisional  decision  maker  would  be 
aware  of  past  actions  taken  by  decision  makers  he  inter¬ 
acts  with  and  also  has  access  to  some  current  information 
concerning  the  state  of  the  organization.  This  current 
information  y^  (t)  can  be  expressed  as 

yj  (t)  =  Hj  (t)  x  ( t)  .  (I) 

The  information  matrix,  H ^ (t) ,  is  used  to  explicitly 
recognize  the  extent  to  which  information  is  decentral¬ 
ized.  In  the  prior  centralized  model  we  did  not  recognize 
an  observation  state,  y(t),  due  to  the  implicit  assumption 
that  all  information  needed  was  available.  A  similar 
situation,  i.e.,  perfect  information  completely  available, 
would  result  if  Hj (t)  is  set  equal  to  the  identity  matrix. 


This  results  in  a  situation  where  the  information  struc¬ 
ture  is  completely  centralized. 


Decentralized  Control 

Since  classical  stochastic  control  is  restricted 
to  a  single  decision  maker,  the  need  for  a  borader  theory 
to  address  decentralized  control  problems  is  apparent. 

The  current  state  of  decentralized  control  theory  is  in 
its  infancy  (Athans :  1978 )  . A  brief  summary  of  the 
evolution  of  decentralized  control  theory  was  discussed 
by  Basar  (1978)  as: 

The  first  decentralized  result  .  .  .  has  been 
obtained  by  Radner  (1962)  who  has  shown  among  other 
things  that  a  static  LQG  team  problem  admits  a  unique 
team-optimal  solution  linear  in  the  observation  of 
each  decision  maker.  This  result,  however,  is  to  be 
interpreted  with  caution  when  the  information  struc¬ 
ture  is  dynamic  and  nonclassical .  The  famous  counter¬ 
example  of  Witsenhausen  (1968)  is  indicative  of  this 
fact,  that  the  team-optimal  solution  of  a  dynamic 
2-member  team  problem  with  2-step  delay  information 
will  in  general  not  be  linear.  Ho  and  Chu  (1972) , 

Chu  (1972),  and  Chu  and  Ho  (1971)  have  studied  non- 
classical  but  nested  information  structures  and  have 
applied  within  that  context  Radner' s  above  cited 
result  to  dynamic  LQG  problems.  The  first  systematic 
formulation  of  decentralized  stochastic  team  problems 
within  a  general  framework  has  been  given  by  Wit¬ 
senhausen  (1971)  where  he  has  made  several  important 
assertions.  One  of  these  assertions  was  team- 
optimality  of  linear  solutions  in  the  optimization  of 
dynamic  LQG  team  problems  under  the  one-step  delay 
information  sharing  pattern.  This  assertion  was  then 
considered  almost  independently  by  Kurtaran  and  Sivan 
(1974),  Sandell  and  Athans  (1974),  and  Yoshikawa 
(1975)  where  the  authors  adopted  a  dynamic  programming 

■^See  this  issue  for  a  review  of  the  current 
state  of  decentralized  control  theory. 
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approach  to  derive  a  set  of  relations  for  the  linear 
solution  of  a  2-member  LQG  team  problem  to  satisfy. 

We  noted  that  the  decentralized  decision-making 
model  we  developed  involved  a  one-step  delayed  informa¬ 
tion  pattern.  The  information  structure  is  dynamic  in 
the  sense  that  current  decisions  are  influenced  by  deci¬ 
sions  made  by  other  decision  makers.  If  an  information 
structure  depends  only  on  the  state  observations  then  it 
is  referred  to  as  static  in  so  much  as  the  decisions  of 
one  member  are  not  affected  by  the  decisions  of  another 
decision  maker.  Yoshikawa  (1975)  has  shown  that  the 
one-step  delay  information  sharing  structure  constitutes 
a  dynamic  problem  that  can  be  decomposed  into  N  static 
team  problems  by  applying  the  technique  of  dynamic  pro¬ 
gramming. 

At  this  point  in  the  development  of  our  model  for 
decentralized  decision  making,  a  cursory  review  of  its 
evolution  reveals  that  we  now  have  a  decentralized  model 
for  decision  making  under  conditions  of  certainty.  How¬ 
ever,  the  concept  of  decentralization  implies  that  uncer¬ 
tainty  exists  with  repsect  to  the  information  availabil¬ 
ity  since  a  world  of  certainty  allows  centralized  decision 
making.  In  fact,  further  analysis  of  the  model,  as  it 
now  exists,  would  reveal  that  it  trivially  reduces  to  a 
situation  with  a  fully  centralized  information  structure. 


CHAPTER  IV 


DECENTRALIZED  DECISION  MAKING  UNDER 
CONDITIONS  OF  UNCERTAINTY 

Introduction 

In  this  chapter  we  extend  our  control  model  to 
allow  for  the  uncertainty  that  exists  in  a  decentralized 
decision-making  environment.  We  then  derive  an  optimal 
decentralized  control  policy  and  apply  the  results  to  our 
specific  model  to  provide  a  mechanism  to  discuss  informa¬ 
tion  availability  and  evaluate  transfer  pricing  policy. 

Model  Uncertainty 

Our  decentralized  model  of  the  firm  is  a  mathe¬ 
matical  model  of  a  physical  process.  The  model  is  an 
approximation  which  neglects  second-order  effects.  How¬ 
ever,  if  the  model  itself  was  exact  structurally,  the 
values  of  the  parameters  used  in  the  model  would  be  esti¬ 
mates  and  may  be  slightly  different  from  their  true 
values.  This  uncertainty  can  be  considered  in  our  model 
by  explicitly  considering  the  stochastic  nature  of  the 
firm.  We  incorporate  uncertainty  into  our  model  as  a 
zero-mean  disturbance  with  a  known  variance.  We 
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represent  model  uncertainty  by  the  stochastic  random 
variable,  w(t),  where 

Etw (t) }  =  0 
E{w(t)w’ (t) }  =  W(t) . 

Thus  w(t)  is  an  n-dimensional  vector  used  to  incorporate 
uncertainty  into  our  model.  For  example,  when  we  assumed 
the  following  structural  relationship, 

Pr(t+1)  =  (l+k)pr(t),  (4-1) 

we  did  not  admit  the  presence  of  any  system  uncertainty. 
In  actuality,  we  do  not  have  the  ability  to  discern  equa¬ 
tion  (4-1)  precisely.  We  would  be  more  accurate  in 
writing  a  relationship  of  the  form 

Pr(t+1)  =  (l+k)pr(t)  +wr(t).  (4-2) 

where  wf(t)  is  defined  above.  In  this  manner  we  allow 
for  model  uncertainties  that  may  exist  in  the  structural 
equations.  The  stochastic  model  of  the  decentralized 
firm  becomes  (from  eqn.  (H) )  and  the  inclusion  of  model 
uncertainty 

2 

x(t+l)  «  ( I+A) x (t )  +  E  B.u.(t)  +  w(t)  (4-3) 

i  =  l  1  X 


" - -  —  lil'iiB  fillilMftaiUftlil  11  UMiiliHrii  r 


where w(t)  represents  the  uncertainty  associated  with  the' 
dynamic  changes  of  the  state  of  the  firm. 

In  addition  to  the  model  uncertainty,  we  also 
recognize  the  lack  of  precision  inherent  in  the  avail¬ 
able  current  information.  We  realize  that  data  are  fre¬ 
quently  used  which  may  be  imperfect  due  to  cost  or  timing 
considerations.  The  decision  maker  must  frequently  base 
his  decisions  not  only  on  incomplete  information  (decen¬ 
tralized  concept)  but  also  on  imperfect  information  that 
is  currently  available.  This  uncertainty  can  be  incorpo¬ 
rated  into  the  observation  portion  of  our  model  in  a  man¬ 
ner  similar  to  the  acknowledgement  of  system  uncertainty. 
We  define  the  random  variable,  v(t) ,  where  E(v(t)}  =  0, 

E(v (t) v' (t) }  =  V (t) .  Then  equation  (I)  can  be  written  as 

yj (t)  =  H j (t) x ( t)  +  Vj(t).  (4-4) 

The  complete  stochastic  model  of  the  decentral¬ 
ized  firm  can  be  represented  by  equations  (4-3)  and  (4-4). 
The  state-space  representation  of  the  model  of  the  firm 
allows  us  to  explicitly  recognize  uncertainty  in  a  dynamic 
environment  not  only  with  respect  to  the  structural  model 
but  also  with  respect  to  the  information  currently  avail¬ 
able  to  the  decentralized  decision  makers. 
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Decentralized  Team  Performance  Criteria 
We  stated  earlier  that  a  major  premise  of  this 
study  is  that  an  organization  can  be  considered  as  a  team 
and  that  incentive  arrangements  can  be  made  to  induce 
team  behavior.  In  an  organization  consisting  of  many 
members  with  different  information  and  decision  possibili¬ 
ties,  it  is  possible  that  some  organizational  objectives 
may  not  be  consistent  with  the  individual  member's  objec¬ 
tives.  The  theory  of  teams  analyzes  organizational  deci¬ 
sion  making  where  different  members'  decisions  may  depend 
on  different  information,  but  a  common  goal  exists. 
Therefore,  in  standard  team  problems,  there  is  no  incen¬ 
tive  problem  since  there  is  no  conflict  of  interest. 
However,  Groves  (1973)  has  shown  that  there  exists  a  sys¬ 
tem  of  compensation  rules,  or  incentives,  that  will  induce 
members  of  an  organization  to  behave  as  a  team.  Groves 
notes  that  the  head  of  an  organization  has  some  latitude 
in  selecting  the  rules  for  compensating  his  managers  and 
it  is  desirable  for  him  to  select  rules,  if  they  exist, 
that  will  induce  his  managers  to  behave  as  if  they  were 
members  of  a  team.  He  terms  any  set  of  compensation 
rules  an  incentive  structure  and  views  the  role  of  the 
organization's  head  as  finding  an  optimal  incentive  struc¬ 
ture  that  will  induce  the  managers  to  behave  as  if  they 
formed  a  team.  Groves'  results  apply  to  what  he  terms  a 
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conglomerate  organization  which  consists  of  a  large  firm 
with  many  plants  independently  producing  and  marketing 
a  wide  variety  of  products.  The  plants  are  linked  only 
through  the  coordinating  decision  of  the  headquarters. 
Although  Groves'  definition  of  a  conglomerate  does  not 
seem  to  address  the  internal  transfer  environment  postu¬ 
lated  in  our  model,  we  note  that  the  derivation  in  the 
next  section  of  this  chapter  considers  the  dynamic  team 
problem  (internal  transfer)  as  a  series  of  static  team 
problems  (conglomerates) .  Therefore,  Groves'  analysis 
is  applicable  to  our  model  with  the  restrictions  we 
impose  in  the  derivation  of  the  optimal  decentralized 
control  model.  It  should  be  noted  that  our  model  excludes 
incentive  schemes  that  are  based  on  accounting  measures 
which  are  affected  by  the  choice  of  the  transfer  price. 

We  do  not  address  the  issue  of  conflicting  objectives 
which  is  the  central  problem  of  transfer  pricing  in  a 
nonteam  environment.  Thus  our  assumption  that  ince  tive 
arrangements  can  be  made  to  induce  team  behav’  -  .  ars 

to  be  reasonable  in  view  of  Groves'  work.  The  control 
technique  that  we  introduced  earlier  can  be  extended  to 
decentralized  decision  making.  For  the  special  case 
of  two  decision  makers,  eq.  (3-1)  can  be  restated  as 
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J  =  E(x(N+i)-x(N+l)  )  'Q (N+l )  (x (N+l ) -X  (N+l ) ) 

N 

+  I  ( (x(t)-x(t) ) 'Q(t) (x(t)-x(t) ) 
t=l 

+  (i^  (t)-u(t)  )  ,R1(t)  (u1(t)-u1(t)) 

+  (u2(t)-u2(t)) ’R2(t) (u2(t)-u2(t)))  (4-5) 

where  J  is  the  expected  cost  to  the  organization  result¬ 
ing  from  deviations  between  actual  performance  and  planned 
performance.  As  before,  individual  decision  makers  are 
punished/rewarded  according  to  deviations  from  targets 
for  which  they  are  held  accountable.  It  should  be  noted 
that  the  "punishment"  concept  can  be,  and  in  most  cases, 
would  be  transformed  to  a  reward  system  to  allow  for 
behavioral  implications.  Although  the  concept  of  reward 
versus  punishment  is  a  moot  point  from  an  analytic  per¬ 
spective,  the  ultimate  success  of  the  actual  control  tech¬ 
nique  could  be  affected  in  large  measure  by  the  manner  in 
which  it  is  presented  to  the  team  members. 

Derivation  of  Optimal  Team 
Decision  Policy  ~~ 

The  formulation  presented  here  is  based  on  a  com¬ 
bination  of  the  results  for  the  linear  quadratic  Gaussian 
problem  assuming  a  one-step,  delayed,  information  sharing 
pattern  and  Radner's  static  team  problem  with  quadratic 
cost  criterion.  The  theoretical  approach  closely  follows 
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that  taken  by  Speyer  and  Krainak  (1979) .  A  dynamic  pro¬ 
gramming  algorithm  is  applied  starting  at  the  terminal 
stage  and  proceeding  backwards  in  time.  At  each  stage 
the  cost-to-go  is  determined,  conditioned  on  the  past 
centralized  information  assumed  available  to  all  decision 
makers.  Minimizing  this  cost  to  go  is  essentially  a 
static  team  problem  where  each  decision  maker  is  to  make 
a  decision  based  upon  his  present  information  (which  only 
he  has)  and  the  past  information  shared  by  all  the  team 
members.  The  success  of  this  procedure  relies  heavily 
on  the  ability  to  reproduce  the  quadratic  cost  functional 
form  for  the  cost-to-go  at  each  stage. 

In  dynamic  programming  the  procedure  is  to  start 
at  the  terminal  stage  and  develop  a  recursion  relation¬ 
ship  operating  backwards  in  time.  The  cost  can  be  written 
as  a  sequence  of  nested  conditional  expectations  as 


—  »P  —  _  «p  _ 

J  =  E(E{(x1-x1)  Q1(x1~x1)  +  (u^u^  R1(u1-u1) 

+{E  ...+.. .E{ (xN+1  -*N+1)  Qn+1 


^XN+1_XN+1  ^^N+l  }/•  •  • 


(4-6) 


The  notation  E  {  ( -  )/I ^  }  denotes  the  expectation  operation 
conditioned  on  1^.  The  nejting  of  expectation  is  done 
with  respect  to  the  shared  information  pattern  and  does 
not  include  the  decentralized  portion.  In  this 


A 
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derivation  we  denote  the  time  period  as  a  subscript  for 
clarity  of  presentation. 

Define  the  cost-to-go  function  as  the  cost  to 
go  to  the  final  stage  from  stage  i  given  1^  denoted  as 

J(Ii)  =  E(E{ (xi~xi)TQi (xi-xi)  +  (ui-ui)TRi (uiui) 
+  E{ . . .  +  . . .E{ (xN+1  -  xn+1)TQn+1 


(XN+1-XN+1)/IN+1]/' ' * 


(4-7) 


where  it  is  assumed  that  an  admissible  control  policy 
sequence  has  occurred  up  to  stage  i-1.  A  recursion 
formula  for  for  all  ie[l,N]  is  obtained  directly 

from  (4-6)  and  (4-7)  as 


J(Ii)  =  E{ (xi-xi)TQi(xi-xi)  +  (ui-Gi)TRi 

(urV  + 


(4-8) 


This  recursion  formula  plays  a  central  role  in  the  develop¬ 
ment  of  the  one-step  delayed  information  sharing  pattern. 


Determination  of  Cost-to-Go 
at  Final  Stage 


From  (4-7)  the  cost-to-go  at  state  N+l  is 


J(IN+1)  E^XN+1”XN+1J  QN+1  (xN+l“XN+l^IN+l^ 


(4-9) 
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The  expectation  can  be  explicitly  determined  because  the 
conditional  probability  density  function  of  given 

IN  +  1  is  normal  with  mean  xN+iyN  and  covariance  PN+1//N'- 
i .  e  .  , 


p(xN  +  l//IN  +  l)  ~  N  (xN  +  l/N,PN+l/N)  (4-10) 

where  x^  +  jy^  is  the  conditional  mean  of  the  state  at  stage 
i+1  given  the  measurement  history  and  Pp+jy^  is  the  error 
covariance  in  estimating  the  state  at  i+1  based  on  the 
measurement  history  up  to  i.  From  Kalman  filtering 

Z'. 

theory,  the  conditional  mean  is  propagated  sequen¬ 

tially  by  the  update  formula 


xi+l/i 


A  .  x  .  , . 
1  l/i 


+  B  .u. 

l  l 


(4-11) 


where 


x . 


i/i 


=  x  .  , .  ,  +  k  .  v  • 
l/i-l  li 


and  the  zero  mean  white  noise  process  is  called  the 
innovations  process  and  is 


=  - 


yi 


(4-12) 


A 


A 

H.X 

1 


i/i-1 


where 
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is  the  estimate  of  the  measurement  y^.  The  variance  of 


VK  is 


=  HiP1/1_lHiT  +  V,  -  Ai 


The  Kalman  gain  is  defined  as 


-  Pi/i-l"!**!'1' 


The  error  variance  is  defined  as 


h/i-i  -  i 

*  Ai-lPi-l/i-lAi-l+  Wi-1 ‘ 


for  convenience,  define 


ei  -  xi  -  *i/i-i  :  E{ei>  *  °-  E(eiei  >  -  pi/i-i 


Ci  xi/i-l  Xi * 


Thus  (4-9)  becomes 

J(IN+1)  =  teN+l  +  CN+l)  QN+1  {SN+1+CN+1)  //IN+1^ 
E^eN+l  °N+leN+l  +  CN+1  °N+1CN+1 
+  2eN  +  l  QN+1CN+1//IN+1^ 

tr  QN+1PN+1/N  +  (XN+1/N_XN+1)  °N+1 
(XN+1/N  ”  XN+1} 
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1 


tr  QN+1PN+1/N  +  XN+1/N  QN+1^N+1/N 

2xN+1  °N+1XN+1/N  +  XN+1  QN+1XN+1 
„  T 

J(IN+1)  "  KN+1  +  XN+1/N  QN+1xn+1/N  ~  2ZN+1XN+1/N 

(4-13) 


where : 


kn+i  tr  °N+1PN+1/N  +  XN+1  QN+1XN+1 


ZN+1  XN+1  °N+1 


Determination  of  Cost-to-Go  From 
Stage  N  to  Stage  N+l 

By  using  (4-8) ,  the  problem  of  finding  the  mini¬ 
mum  value  of  J,  J*,  is  obtained  recursively  by  using  the 
fundamental  theorem  in  Meier,  et  al.  (1971)  which  states 
that 


J*  =  minimum  E{ J ( I  ) }  =  minimum  E{min  J(I  )} 
u.Vi£[l,N]  N  uiVie[l,N-l]  uN 


(4-14) 


Note  that  J(IN)  has  been  used  for  convenience  to  denote 
J(IN,uJ(IN) )  j=l,...,k)  where  u^(I^)  for  j=l,...,k  is  now 
to  be  determined  by  the  minimization  in  (4-14)  as 

J*(IN>  =  min  Et  (Xn-5n)Tqn(xn-Jn)  +  (“n-5nITRn<“n-;1n) 

N  1  N; 


(4-15) 
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J*<V  *  n>inE{(eN+cNrQN(eN+cNl  +  <>W  ^  “VUN> 

UN (IN' 

+  KN+1  +  XN+1  °N+1XN+1/N 

2zn+ixn+i/n//in  * 


J*(IN)  min  E{tr  QnPN-1  +  XN/N-1  QNXN/N-1 

Un(in} 

2xxTQNXN/N-l  +  XN  °NXN  +  UN 

+  GnTrnGn-2unTrnGn  +  kn+i 

+  (ANXN/N-1  +  BNUN  +  AkkNVN>  QN+1 

(ANXN/N-1  +  BNUN  +  ANkNVN> 

2ZN  +  1  (ANXN/N  +  1  +  BNUN  +  ANkNVN)/IN'> 

J*(XN)  =  min  eJkn+kn+1+unTRnun+xn/n_i  (Qn+AN  QN+1AN) 

W 

XN/N-1  +  UN  (BN  QN+1BN+RN)UN 

2  (zn+zn+ian)xn/n-i 
-  V<Vn  +  BNTzn+1T> 

+  kN  N  QN+lANkNVN  +  2UN  BN  QN+1ANXN/N-1 

+  2  UN  BN  QNflANkNVN  +  2VN  kN  ^  ^^N+l^^/N-l 
T  T1  T*  T 

2  kN  ^  ZN  +  1 


(4-16) 
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define 


then  (4- 

J*  ( I 


£n  kn+kn+i+un  rnun 

2n  -  vWiN 

=  bn  qn+ibn  +  ’Sj 

-N  =_ fZN+ZN+lAN^ 

2k  '  -<Vn  +  bnX+1t> 

-N  =  kN  *N  QN+lANkN 
=  BN  '^N  +  lSj 
=  BN  QN+lANkN 

it,  ’  “nXX.i'S) 

5*4  lkN  *N  ZN+1  5 

16)  can  be  written  as 

N>  •  Minimum  E ( +  ’Vn-iX'Vn-I 

UN  UNJ 

+  UN  ^nUN  +  2-NXN/N-l 
+  2UN  ^*4  +  VN  ^4VN  +  2UN  ^NXN/N-1 
+  2uN  ^4VN  +  2VN  Vn/N+1  +  2VN  ^N/:IN}* 


(4-17) 


Since  is  measureable  with  respect  to  the 

o-algebra  generated  by  1^,  the  random  vector  over  which 

the  expectation  conditioned  on  1^  is  taken,  is  Note 

that  since  depends  on  I N 1 ,  Uj^1  depends  explicitly  on 

the  local  information  v.,1'  The  determination  of 

N  N  N 

is  a  static  decentralized  team  problem.  Before  proceeding 
to  the  static  team  problem,  we  define  4>  as 

^ N  =  UN  5nUN  +  2UN  +  2UN  ^NXN/N-1 

+  2UN  ^NVN  +  VN  ^NVN  +  2vn  ^NXN/N-1 


+  2vnV 


(4-18) 


Using  (4-18) ,  we  rewrite  (4-17)  as 


JMV 


+  XN/N-1  ^NXN/N-1  +  2^NXN/N+1 

+  min  eH  /i  ). 

N  N 

UN 


(4-19) 


The  static  team  problem  is  to  minimize  J(I„) 

N 

with  respect  to  the  control  function  u„^(I.,^)  where  I  ^ 

N  N  N 

is  defined  as  the  union  of  I,,  and  v  .  The  decision 

N  N 

function  can  be  written  as 


V'V’  -  '  *N/N" 


(4-20) 


Radner  showed  that  for  a  quadratic  cost  criterion. 


this  functional  form  is  linear  and  can  be  expressed  as 
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n  3(1  ^  )  =  D^y^+C^(x  ) 

N  1  N  1  N  N  N  1  N/N+1J 


(4-21) 


which  satisfies  a  coupled  set  of  stationary  conditions. 

To  develop  the  stationary  conditions,  suppose  that 
the  decision  functions  of  all  but  one  of  the  team  members 
are  fixed.  Then,  a  one-person  minimization  is  performed 
by  assuming  that  the  fixed  decision  functions  of  the 
other  team  members  are  at  their  one-person  minimum  denoted 

/n  “i  i 

as  uNJ  (INJ) .  The  one-person  cost  criterion  is 

uNJ^Nj+1(I-i+1) . 0(JMk))  J  (4-22) 


N 


N  '  N 


under  proper  conditions  (Radner : 1962 ) ,  the  operations  of 
expectation  and  differentiation  can  be  exchanged  to  give 


E%j%ViN’)-  e  (3I^  j1"  ’>  -  o; 
3un  3un 

j  =  l , . . .  ,  k 


(4-23) 


Equation  (4-23)  results  in  the  following  K  stationary 
conditions 


E{T~_D  tuN  ^NUN  +  2uN  -N  +  2UN  ^NXN/N-1 
3  N 


+  2uN  51nVN//IN^  ^  “  0;  j-l,...,K 


(4-24) 
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Thus,  the  first-order  conditions  become 

E(ji  <RNjV’  +  5n’Vn-1  +  V 

+  vN/INj  }  =  0;  j  =  l , . . . , K  (4-25) 

where 

(  k,-5  denotes  the  jth  row  of  (  ).. 
note  that 

M  j  _  *  M  jl  i  ,,, 

~N  VN  -  ^  (4~26) 

Rewriting  (4-25)  and  (4-26)  and  considering  (4-21) 
results  in 

E(  [  <VV  ♦  +  J.  W 

1=1 

+  ^N^N/N-l  +  =  0;  (4-27) 

to  evaluate  expectations,  recall 

p(vij)  ~  N(0,Ai)  ~  N(0,HIjPi/i_1HijT  +  Vij) 


note  that 
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E{ v  .  Xv  .  ^T}  =  e{v1v:’T}  =  E{  (y.  1-H.  1X.  ..  ,) 
1  1  Ji  l  l/i-l 


E{„.iv/T) 


(xr4t/i-i’  VW  JT 


.  »i1(x1-^.l»H«  *  vtVT> 


K  3Ti 


E  t  V  . 1  v  .  }  =  H.XP.  ..  .  H. 

ll  l  l/i-l  l 


(4-28) 


therefore 


f.(v1v3)  ,  N(0,L) 


(4-29) 


where 


l11  iLij 
_  _ 

Ljl  Il« 


HiPHlT+Vi 


HjPHlT 


h^ph^t 

H^PH^+V^ 


and  the  conditional  density  is  (Jaswinski,  p. 45:1970); 

p(v1/v^)  -  N(Llj  (Llj)”1v:j,  L^1-L1^  (L^  )  ~1L^) 

(4-30) 

Using  (4-30)  we  define  the  following  conditional  means 


Eiv  =  v  3 

N  '  N  N 


(4-31) 


I 
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E{vN1'V  -  v„3 

=  hn  pn/n-ihn3  (hn3pn/n-ihn3  +  vn’  un^ 

(4-32) 

Combining  (4-27),  (4-31),  and  (4-32)  and  taking  the 
expectation  explicitly  yields 


‘VV  *  »N33i"Nj  +  <VV  *  «h31» 

m 


H  1P  H  jTA  j"Xv  j 

“n  pn/n-ihn  an  N 


+  +  S"J* 


^N^N/N-l  +  ^t)3  *  °!  3-1 . K 

(4-33) 


since  is  arbitrary 


S^V  *  Bn”  *  1£1  <VV  *  «Njl> 

l^j 


HN1WlHNj’V'1  *  °!  5'1 . *  ,4-341 


and 


1=1^^  +  ^N^XN/N-1  +  ^  “  0;  <4“35) 


rewriting  (4-35) ,  since  >  0 


-1 


CN  ”^N  [^NXN/N-1  + 
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(4-36) 


thus , 


UN  DNVN  +  CN 


(4-37) 


where  u„  is  the  decision  vector  for  the  team  as  a  whole 
N 

at  time  N  and  DN  is  the  block  diagonal  matrix  formed  from 
the  individual  decision  maker's  innovations  process  gains 
where  DN  is  determined  as  the  unique  solution  to  (4-34) 
and  CN  is  given  by  (4-36) .  Inserting  (4-37)  into  (4-19) 
yields 


J*(V  ~  -N  +  XN/N-1  2nXN/N-1  +  2^NXN/N-1 

+  E{  (VN  -  VVXN/N-1  -  BN"1dN)T 

5n(DNVN_  §^N/N-1 

+  2(DNVN  "  -NXN/N-1  " 

+  2(DNVN  "  -NXN/N-1  '  ^N1  ^NXN/N-1 

+  2(dnvn  -  VWi  - 

+  VN  ^NVN  +  2VN  -NXN/N-1 


+  2vn  W 


(4-38) 
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J*(IN)  =  -N  +  XN/N-1  &N*N/N-1  +  2-NXN/N-l 

+  E^N  °N  °NVN  +  XN/N-1  -N  -N^/N-l 

+  -N  +  — NXN/N-1 

-2-N  -N  ^NXN/N-1  "  2— N  -N 


T„  T_  -1 


T„  -1, 


"  2xN/N-l  ^NXN/N-1  2^N  ^N^/N-l 


+  2vN  DN  +  VN  — NVN//IN^ 


(4-39) 


J*(IN)  “  +  XN/N-1  ^NXN/N-1  +  2^NXN/N-1 

t  /r^  T„  -1_  +  2MmTDkt  +  N-,)Am 

tr  (dn  5n  dn  n  -N  N 


-  x 


T„  T„  -1 


T_  -1 


N/N 


-1  5nXN/N-1  '  ^ 


2-n  *N  5mXN/N-1 


(4-40) 


define : 


=  — N  +  tr  {D*\~\  +  2%i\  +  — 


AN  " 


— N  ”  — N  -N 


2N  “ 


J 
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thus  we  rewrite  (4-40)  as 

J*(IN)  Kn  +  xNyN_1  Qn^n/n-i  "  2-^n*N/N-1  (4-41) 


Note  that  JMI^)  given  by  (4-41)  is  functionally  similar 
to  given  by  (4-13). 

Determination  of  Cost-to-Go  From 
Stage  N-l  to  Final  Stage 


=min  ,T  QN-1(xN-1"*N-1) 

UN-1UN-1} 


+  <UN-l"^N-l)  T  RN-l(uN-l'UN-l)+J(IN)/IN-l} 

(4-42) 


-  t-  ~  - 

J*(IN-l)  =  KN-l+XN-l/N-2  °N-lXN-l/N-2  +  2ZN-lXN-l/N-2 


+  min  E{uN_1  +  2un_1  dN_x 

V-l 


+  VN-1  ^N-1VN-1  +  2UN-1  BN-l*N-l/N-2 


+  2uN-1  **N-1VN-1  +  VN-1  ^N-l*N-l/N-2 


+  2VN-1  aN-l/i:N-l} 


(4-43) 


where 

*N-1  =  ^-1  +  *N  +  “n-1  ^-l^N-l 
®N-1  =  QN-1  +  ^Sj-l  ^N^N-l 


*N-1  =  ^-1  +  BN-1  ®NBN-1 
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5N-1  [rn-iun-i  +  BN-1  ZN  1 

**N-1  =  kN-l  \-l  ®NAN-lkN-l 
BN-1  =  BN-1  ^VSj-i 
^N-l  =  BN-1  ^NAN-lkN-l 
*N-1  =  ^-1*11-1  Vn-1 

Vi -  -iviViVi 

Note  that  (4-43)  is  of  the  same  functional  form  as  (4-17) 
which  indicates  that  Radner's  static  team  problem  must 
be  solved  again  for  J*(IN_^)  which  results  in 


UN-1  °N-1VN-1  +  CN-1 


(4-44) 


where  DN_^  is  determined  by 


(4-46) 


thus  i)  becomes 

N-  I 

J*(IN-l)  =  V-l  +  XN-l/N-2  ^N-lXN-l/N-2 

_2^N-lXN-l/N-2  (4-47) 

where 

*N-1  =  *N-1  +  tr  (dn-i  Vi  dn-i 
+  2RN-1DN-1+  Vl'Vl 
V-l  V-l  V-l 


°N-1  °N-1  "  V-lV-1  SN-1 

V-l  =  V-l  V-l  ^N-l  "  V-l* 


Since  J*(IN_^)  given  by  (4-47)  is  functionally 
similar  to  J*(IN)  and  J(IN+^)r  the  general  recursion 
relationships  in  going  from  stage  i+1  to  stage  i  can  be 
stated  by  appealing  to  an  induction  argument.  That  is, 
the  results  in  going  from  N+l  to  N  hold  in  going  from 
state  i+1  to  i  if  i  replaces  N.  By  induction,  the  optimal 
decentralized  control  policy  using  the  one-step  delayed 
information  sharing  pattern  at  stage  i  is 
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UK(  VK,XK(K-1)  °KUK  "  <RK  +  BK  ^K+l^1 

lBK  QK+1AKXK/K-1  “  RKUK  “  BK  ZK+1  ^  (4-48) 

where  D„  is  determined  by: 

A 

(RK  *  BKT0K+lBK)j3DK3  +  <BKT«K*lAKkK)33 


and 


+  J‘i(rk+bkT°k.iV3V  +  (bkT°k+iVk)31' 

1?£j  .T 

HKlpK/K-lHKj  AKj"1}  =  0»j=1»**"K  (4-49) 


UK  QK  +  AK  QK+1AK  ~  (BK  ^K+1AK5 


<rk  +  bk\+ibk)'1(bkTOk+xak)  <4-50) 


qn+i  qn+i 


(4-51) 


ZK  ZK  +  ZK+1AK  "  (RK^K  +  BKZK+1  } 


(rk  +  bk  qk+ibk)  (bk  ^k+iak) 


(4-52) 


ZN+1  ZN+1  XN+1  °N  +  1 


(4-53) 


Equations  (4-48)  -  (4-53)  represent  the  generalized  opti¬ 
mal  control  policy  for  a  mean  deviation  quadratic  penalty 
function.  These  control  theory  results  extend  prior 


results  in  the  literature  for  a  two-person  team  to  encom¬ 
pass  a  general  n-person  team  setting.  In  addition,  the 
performance  measure  allows  the  optimal  control  law  to  be 
a  function  of  predetermined  state  and  control  objectives, 
or  targets.  It  should  be  noted  that  the  unpublished  paper 
by  Speyer  and  Krainak  (1979)  has  solved  a  K-person  linear 
exponential-Gaussian  control  problem  where  the  controls 
are  penalized  over  time  and  the  final  state  is  penalized. 

Our  general  results  can  now  be  applied  to  the  two- 
member  team  model  of  this  dissertation.  However,  we  note 
that  the  control  theory  results  represented  by  eqns. 

(4-43)  -  (4-48)  can  be  applied  to  problems  involving  both 
time-varying  parameters  in  the  system  and  observation  model 
in  addition  to  considering  time  varrying  error  variance  in 
both  the  state  and  the  observations. 

Application  of  General  Results  to 
Two-Person  Team 

For  a  two-person  team  equations  (4-48)  -  (4-53) 
can  be  expressed  as 

UK  =  °KVK  +  EKXK/K-1  +  FK  (4-54) 


where 
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and  can  be  determined  as  the  solution  to  the  following 
set  of  coupled  equations: 

(BK1T0K+lBKl  +  V’  DK1(HKlpK/K-lHK1T  +  V1 

1T~  2  2  2  IT 

B  qk+ibk  dk  hk  pk/k-ihk 


bk  ^k+iakpk/k-ihk 


(4-55) 


bk2TQk+ibk1dk1hk1pk/k-ihk2T  +  [bk2TQk+ibk2+rk21 

dk  [hk  pk/k-ihk  +vk  1  =  ~bk  ®k+iakpk/k-ihk 

(4-56) 


in  addition 


E  = 
K 


ekX 

rk+bk  ®k+ibk 

B  1T0  B  2 

bk  qk+ibk 

_ek2_ 

B  2Tn  B  1 

_K  QK+1BK 

r  2+b  2t5  b  2 

K  +BK  QK+1BK_ 

-1 


bk1T«k+iak 


bk  ^k+iak 


(4-57) 


and 
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K 


1 

„  1  „  1T~  „  1 

„  1T~  „  2 

F 

rK 

= 

rk  +bk  qk+ibk 

bk  qk+ibk 

F  2 
_  R  _ 

VWlV 

-1 


rk  15k1+bk1T*k+iT 

’'kW'W-J 


(4-58) 


The  application  of  equations  (4-54)  through 
(4-58)  to  the  decentralized  model  of  the  firm  developed 
in  this  dissertation  results  in  further  simplifications 
due  to  our  assumption  of  time-invariant  parameters  and 
statistics. 


Optimal  Control  Policy  for  Decentralized 
Model  of  the  Firm 

The  optimal  decision  for  the  Production  Division 
can  be  expressed  as 

ul(t)  =  DL(t)v(t)  +  E^tjxft/t-l)  +  Fx(t)  (4-59) 

u^t)  =  D1(t)y1(t)  +  [E-^t)  -  D1(t)H1(t)] 

x(t/t-l)  +  F^t)  (4-60) 

Similarly,  the  optimal  decision  policy  for  the  Marketing 
Division  can  be  expressed  as 

u2(t)  =  D2(t)v(t)  +  E2(t)x(t/t-l)  +  Fx(t)  (4-61) 


iHH  mi  mi  litiriiii 
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u2(t)  =  D2(t)y2(t)  +  [E2(t)  -  D2(t)H2(t)] 


x(t/t-l)  +  F2 (t) 


(4-62) 


where  D^(t)  and  D2(t)  can  be  found  as  the  unique  solu¬ 
tion  to  the  following  set  of  coupled  equations: 

[B1TQ(t+l)B1  +R1]D1(t)  [H1P  (t/t-l)H^+V^] 

+  B1TQ(t-H)B2D2  (t)H2P(t/t-l)H1T 


=  -B1TQ (t+1) AP (t/t-l)H1T 


[B2TQ(t+l)B1D1(t)H1P(t/t-l)H2T] 


+  [B2  Q(t+l)B2+R2)D2(t) 
[H2P(t/t+l)H2T+V2) 


=  -B2TQ(t+l)AP(t/t-l)H2T 


(4-63) 


further 


Ex(t) 


E2(t) 


(4-64) 


R1+B1TQ(t+l)B1  B1TQ(t+l)B2 


R2+B2  Q(t+1)B2 


B2iQ(t+l)B1 


B^Qlt+DA 

B2TQ(t+l)A 


(4-65) 
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and 


Fx  (tl 

R1+B1TQ(t+l)B1 

B1TQ(t+l)B2 

B2TQ(t+l)B1 

R2+B2TQ(t+l)B2 

Rl“l  (t>+B1T~Z(t+l>T 
R2u2 (t)+B2TZ (t+l)T 


-1 


(4-66) 


The  theoretical  results  (eqns.  (4-60)  and  (4-62)) 
provide  us  with  an  intuitively  appealing  decision  process. 
We  note  that  the  optimal  decisions  are  a  weighted  average 
of  the  current  private  information  available  to  the 
decision  maker  plus  information  based  on  past  actions 
and  the  current  targets  (or  objectives)  that  have  been 
established.  Therefore,  the  decision  maker  considers  all 
three  of  these  sources  to  arrive  at  an  optimal  decision. 
The  relative  importance  of  private  information  versus 
historical  information  versus  current  objectives  is  deter¬ 
mined  by  the  system  parameters,  the  observation  parameters 
and  the  performance  function  weighting  parameters. 

Further  analysis  of  the  optimal  decision  policy 
characteristics  is  limited  due  to  the  theoretical  nature 
of  this  dissertation.  However,  certain  structural  con¬ 
straints  currently  imposed  on  the  model  do  not  appear  to 
be  atypical  of  a  large  number  of  actual  decentralized 


organizations.  Additional  restrictions  are  necessary  to 
evaluate  the  effect  of  a  change  in  transfer  pricing 
policy.  Thus,  further  conclusions  reached  in  this  dis¬ 
sertation  must  be  limited  to  the  current  model  as  have 
developed  and  extensions  to  other  models  should  be  per¬ 
formed  with  caution. 

Incentive  Arrangement 

Incentive  arrangements  are  incorporated  into  our 
model  through  the  performance  measure  function  weighting 
matrices,  Q,  R^  and  R2 .  The  specific  relationship  is 
recalled  as 


J  =  E  (x(N+l)-x(N+l))'Q(N+l)  (x (N+l)  -x  (N+l ) ) 

N 

+  E  ((x(t)-x(t)) 'Q(t) (x(t)-x(t)) 
t=l 

+  (u1(t)-u1(t) ) ,R1(u1(t)-u1(t) ) 

+  (u2(t)-G2(t) ) ’R2(u2(t)-u2(t) ) )  (4-67) 

where  the  objective  of  the  team  is  to  minimize  the 
expected  "cost"  function,  J.  We  observe  that  each  vari¬ 
able  can  be  isolated  by  appropriate  definitions  of  the 
weighting  matrices.  For  example,  we  observe  that  the 
weighting  matrix  Q  can  be  used  to  affect  the  degree  to 
which  the  decision  makers  attempt  to  achieve  predetermined 


values  of  the  state  variables  where  we  defined  the  state 


X1 

Pr 

Raw  material  price 

X2 

PT 

Transfer  price 

X3 

= 

b2 

= 

Demand  parameter 

X4 

h„w 

TM 

Marketing  inventory 

_X5_ 

hTP 

Production  inventory 

It  was  noted  earlier  that  the  decision  makers  have 
no  direct  control  over  the  raw  material  price,  the  trans¬ 
fer  price  or  the  demand  parameter  and  we  would  not  expect 
to  observe  incentive  arrangements  regarding  these  vari¬ 
ables.  For  the  current  model  we  would  expect  an  incentive 
arrangement  to  result  in  a  weighting  matrix  represented 
as 


0  0 
0  0 


Q  = 


0  0 


0  0 


0 


0 


0  0  0 

0  0  0 

0  0  0 

o  Q1  0 

0  0  Q2 


(4-69) 


where  and  Q2  represent  the  incentives  with  regard  to 
desired  levels  of  inventory/backlog  for  the  marketing 
and  production  divisions  respectively.  A  similar 
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observation  with  respect  to  the  control  variables  u^ 
and  u2  which  we  defined  as 


u 


1 


u 


2 


production  decision 

internal  transfer  decision 
pricing  decision 


(4-70) 


leads  us  to  anticipate  the  following  structural  forms: 


R 


1 


'll 


(4-71) 


R2  - 


r21  0 


R 


22 


(4-72) 


where  ,  Q2 ,  R.^,  and  R22  represent  the  various  incen¬ 
tive  arrangements.  It  is  reasonable  to  expect  that  the 
state  and  control  variables  may  not  be  considered  equally 
important  to  the  firm  as  a  whole.  In  addition,  differ¬ 
ences  in  unit  measurements  would  indicate  that  a  per¬ 
centage  deviation  scheme  might  be  more  desirable  than  an 
absolute  deviation  philosophy.  Unfortunately,  the  theo¬ 
retical  nature  of  this  dissertation  does  not  lend  itself 
to  detailed  analysis  with  regard  to  alternative  incentive 
arrangements  although  this  particular  aspect  of  the  con¬ 
trol  model  would  consistute  a  significant  portion  of  an 


J 
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empirical  application  of  the  model.  To  facilitate  fur¬ 
ther  analysis  in  this  study,  we  "weight"  the  target 
variables  equally  based  on  absolute  deviation;  that  is, 
<2^=02  =  =  R2 ^  =  R22  =  We  note  that  this  assump¬ 

tion  is  not  essential  to  the  following  analysis  but  is 
made  merely  as  a  mathematical  convenience. 

Information  Structure 

For  illustrative  purposes  we  assume  the  decen¬ 
tralized  information  structure  is  such  that  the  production 
division  receives  current  information  concerning  the  raw 
material  price,  the  transfer  price  and  its  own  inventory/ 
backlog  status.  Similarly,  the  marketing  division 
receives  current  information  concerning  the  transfer 
price  and  its  own  inventory/backlog  status.  This  assump¬ 
tion  posits  a  situation  that  would  represent  an  expected 
lower  bound  on  current  information  availability;  i.e., 
we  would  expect  the  production  decision  maker  to  have 
access  to  current  raw  material  prices,  his  current  inven¬ 
tory  position  and  the  current  transfer  price.  We  would 
also  envision  the  marketing  decision  maker  to  have  access 
to  his  current  inventory  position  and  the  current  transfer 
price  as  a  minimum.  This  information  structure  is  cap¬ 
tured  in  the  model  by  defining  the  following  information 


matrices : 
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1  0  0  0  0 


H 


1 


0 

0 


1 

0 


0 

0 


0  0 
0  1 


H 


2 


0 

0 


1 

0 


0 

0 


0  0 
1  0 


(4-73) 


We  note  that  the  production  decision  maker  has  current 
information  concerning  raw  material  prices  and  production 
inventory  that  is  not  available  to  the  marketing  division, 
while  the  marketing  division  has  current  information  con¬ 
cerning  their  inventory  position  that  is  not  available  to 
the  production  decision  maker.  Further,  both  decision 
makers  are  aware  of  the  current  existing  transfer  price. 

Transfer  Pricing  Policy  Analysis 
To  facilitate  analysis  with  respect  to  the  impact 
of  changes  in  the  transfer  pricing  policy  on  our  decen¬ 
tralized  control  model  of  the  firm,  we  have  posited  an 
incentive  arrangement  and  a  specific  information  struc¬ 
ture.  We  make  one  additional  assumption  with  regard  to 
the  stochastic  nature  of  the  problem  and  assume  that  the 
stochastic  parameters  are  zero-mean  and  unity  variance 
random  variables.  This  assumption  does  not  affect  the 
generality  of  the  analysis  and  is  made  for  mathematical 


■  1  ...  .  -  ■  WkuAt.  f  «.  .—i 
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convenience.  We  restate  eqns.  (4-60)  and  (4-62)  for 
reference  as 

ux(t)  =  D1(t)y1(t)  +  (E1(t)-D1(t)H1(t))x(t/t-l) 

+  F1(t)  (4-74) 

u2(t)  =  D2(t)y2(t)  +  (E2(t)-D2(t)H2(t) )x(t/t-l) 

+  F2(t)  (4-75) 

and  note  that  F^(t)  and  F2(t)  are  not  dependent  on  the 
transfer  price.  We  can  determine  the  effect  of  transfer 
pricing  changes  on  the  optimal  decentralized  decisions 
by  an  analysis  of 

D1(t)y1(t)  +  (E1(t)  -  D^tjH^t)  )x(t/t-l)  (4-76) 

to  determine  the  impact  of  a  change  in  the  transfer  price 
on  the  production  decision.  Similarly,  we  will  be  able 
to  evaluate  the  impact  of  a  transfer  pricing  change  on 
the  pricing  decision  and  the  internal  exchange  decision 
by  examining 

d2 (t)y2 (t)  +  (E2 (t)-D2 (t)H2 (t) )x(t/t-l) .  (4-77) 

We  observe  that  the  only  time-varying  matrix 
involved  is  Q(t) ,  whose  solution  is  given  by  the  discrete 
Riccati  eqns.  (4-50)  and  (4-51).  To  determine  the  impact 
of  a  change  in  transfer  price  on  the  optimal 
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decentralized  decision  policy  we  compute  u(N)  where 
Q  (N  +  l)  =  Q (N+l ) . 

To  determine  (N)  and  D2(N)  we  compute  the  fol¬ 
lowing  matrices: 


BjQ(N+l)B1  +  R1  =  2 


H1P(N/N-1)H^  +  V1  = 


2  0  0 
0  2  0 
0  0  2 


BjQ(N+l)B2  =  [-1  0J 


H2P(N/N-1)H^  = 


0  10 
0  0  0 


BjQ(N+l) AP(N/N-1)H^  =  [0  0  1] 


B^Q(N+1)B1  = 


-1 

0 


H^fN/N-llH^  = 


0  0 
1  0 
0  0 


B^Q(N+1)B2  +  r2  = 


(1+bJ) 


'•mm, 
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H2P(N/N-1)H^  +  V2 


2 

0 


0 

2 


B^Q(N+1) AP (N/N-l ) H2 


0  0 
0  b. 


Substituting  these  values  into  eqs.  (4-63)  and 
(4-64)  and  solving  the  resulting  set  of  coupled  linear 
equations  simultaneously  results  in  the  explicit  solu¬ 
tion  of  D (N)  as  follows: 


Dl(N) 

D2(N) 


(0  0  -1/4] 

0  1/6 
0  -b1/(2b^  +3) 


(4-78) 

(4-79) 


Further  algebra  and  redefinition  of  constant 
terms  results  in  the  following: 


D^N)  =  , 

[0  0 

K. 

0 

(4-80) 

0  K_ 

D2(N)  = 

(4-81) 

! 

0  K_ 
L_  3_ 

E1(N)-D1(N)H1  = 

= 

[0 

0 

K4 

K5  k6] 

(4-82) 

0 

Q 

k7 

00 

vo 

1 

E„ (N) -D- (N) H~  = 

: 

7 

£  t 

4. 

0 

0 

K10 

K11  K12 

(4-83) 

liiill'Wiii'il  ni'frj.m  iTirffnviiii 
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where  K^;  i=l,2,...,12  are  known  constants. 

Combining  eqns.  (4-76)  and  (4-80)  -  (4-83)  pro¬ 
vides  the  information  needed  to  explicitly  determine  the 
impact  of  the  transfer  price  on  the  production  decision  as 

UL  (N)  u(0  0  KjJ  {pr  pT  hTp]T 

+  [0  0  K4  K5  K6]  [pr  pT  b2  hTM  hTp]T.  (4-84) 

We  note  that  the  optimal  production  decision  is 
not  dependent  upon  the  actual  transfer  price  in  our 
model.  Similarly,  the  marketing  division's  decision 
process  is  partially  represented  by  rewriting  eqn. 

(4-77)  as 


[Pr  P>r  b2  hTM  hTP^  ’ 


(4-85) 


Again  we  observe  that  both  the  optimal  pricing 
decision  and  the  optimal  internal  product  transfer  deci¬ 
sion  are  independent  of  the  transfer  price.  Since  the 
functional  form  of  the  optimal  decision  rule  is  repeated 
for  all  periods  the  analysis  regarding  the  transfer 
pricing  for  period  N  can  be  extended  for  all  decision 
periods.  The  implication  of  the  above  analysis  is  that, 
under  the  assumptions  inherent  in  the  research  model, 


the  establishment  of  transfer  pricing  policy  does  not 
affect  the  optimal  decentralized  team  decision  making 
policy.  The  transfer  pricing  decision  was  removed  from 
the  control  of  the  decentralized  decision  maker  and  the 
transfer  price  was  assumed  to  be  an  exogenous  variable 
which  is  generated  by  a  first-order  Markov  process.  Thus 
under  the  restrictive  conditions  of  the  model  (i.e.,  a 
team  setting  coupled  with  an  exogenous  transfer  price) , 
we  observe  that  neither  decision  maker  uses  information 
concerning  the  transfer  price  and  thus  the  determination 
of  the  transfer  pricing  policy  is  not  important  with 
respect  to  his  optimal  actions.  Recall  that  the  indi¬ 
vidual  decision  maker  has  knowledge  of  the  impact  of  his 
decisions  on  the  firm  as  a  whole.  In  this  setting,  the 
decision  maker  would  realize  that  the  transfer  price  is 
an  internal  mechanism  for  the  firm  and  as  such  it  has  no 
impact  on  the  overall  objectives  of  the  firm  in  a  team 
setting.  This  result  holds  for  both  one-period  and  multi 
period  analyses  since  the  dynamic  results  we  have 
generated  can  be  easily  reduced  to  a  single-period  analy¬ 
sis  by  discarding  the  time  argument.  This  result  can 
be  anticipated  by  the  realization  that  the  decision 
makers  are  attempting  to  achieve  corporate  objectives 
which  are  derived  from  an  overall  perspective  based  on 
external  market  conditions.  The  internal  transfer  price 


is  merely  a  means  for  liquidity  transference  and  should 
not  affect  the  performance  of  the  organization  as  a 
whole;  that  is,  the  optimal  decentralized  team  decision 
making  policy  is  independent  of  transfer  pricing  policy 
and  the  determination  of  an  "optimal"  transfer  pricing 
policy  should  not  be  based  on  its  effect  on  the  optimal 
decentralized  decision  maker's  actions. 


CHAPTER  V 


SUMMARY  AND  DIRECTIONS  FOR  FUTURE  RESEARCH 

Summary 

This  dissertation  has  addressed  the  unique 
aspects  involved  in  controlling  a  decentralized  organiza¬ 
tion.  We  have  integrated  the  concepts  of  modern  control 
theory  with  the  concepts  of  team  theory  to  develop  an 
optimal  control  policy.  This  policy  was  then  applied  to 
the  conceptual  framework  developed  for  the  analysis  of 
decentralized  decision  making. 

The  main  finding  of  this  study  is  that  the  trans¬ 
fer  price  involved  in  the  interdivisional  exchange  of 
goods  or  services  does  not  affect  the  decentralized  deci¬ 
sion  maker's  actions.  In  a  team  setting  we  showed  that 
the  optimal  decentralized  decision-making  policy  is  not 
dependent  on  the  transfer  price.  If  the  operation  of  a 
transfer  pricing  system  is  costly  to  the  organization  as 
a  whole,  our  research  has  shown  that,  for  optimal  decision 
making  in  a  team  setting,  transfer  pricing  may  be  an 
ineffective  decision-making  tool.  In  this  setting  we 
would  not  expect  to  see  a  transfer  pricing  system  used 
for  decentralized  decision  making.  If  a  team  setting 
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exists  in  a  decentralized  organization,  through  some 
incentive  system,  the  expenses  involved  with  a  transfer 
pricing  system  may  be  avoided  and  a  resultant  increase 
in  efficiency  could  be  realized.  Although  this  result 
does  not  provide  a  procedure  for  determining  the  "optimal" 
transfer  pricing  policy,  it  does  indicate  that  transfer 
pricing  policy  decisions  should  not  be  based  on  their 
impact  on  optimal  decentralized  decision  maker's  actions. 
However,  this  result  was  conditioned  on  two  restrictive 
assumptions  in  the  dissertation  development.  The  first 
of  these  involved  treating  the  organization  as  if  it  is 
a  team.  If  this  assumption  is  discarded  and  the  transfer 
price  becomes  a  mechanism  to  improve  the  position  of  one 
division  at  the  expense  of  another,  then  the  results  of 
this  dissertation  would  not  be  applicable. 

The  second  assumption  considered  the  transfer 
price  itself  as  an  exogenous  variable  generated  as  a 
first-order  Markov  process.  The  effect  of  removing  the 
transfer  pricing  decision  from  the  control  of  the  decen¬ 
tralized  producer  of  the  transferred  product  was  not 
investigated  in  this  research  study.  It  is  not  readily 
apparent  what  impact  this  assumption  would  make  on 
the  dissertation  findings. 


Directions  for  Future  Research 

The  focus  throughout  this  dissertation  has  been  a 
theoretical  one.  However,  the  framework  developed  pro¬ 
vides  a  means  to  evaluate  the  model  empirically.  Future 
research  could  branch  in  at  least  two  complimentary  direc¬ 
tions.  First,  the  framework  itself  can  be  refined  by 
extending  it  to  address  parameter  uncertainty.  Second, 
empirical  research  could  be  performed  in  an  econometric 
sense  to  determine  how  the  model's  decisions  compare  with 
actual  decisions  made  by  decentralized  managers.  Further 
analysis  that  considers  the  transfer  price  as  a  decision 
variable  under  the  control  of  a  decentralized  decision 
maker  needs  to  be  performed.  The  results  of  this  analysis 
would  provide  insight  to  the  robustness  of  the  findings 
of  this  dissertation. 

An  interesting  empirical  question  lies  unanswered 
concerning  the  main  premise  of  this  dissertation;  i.e., 
an  organiztion  can  be  considered  as  a  team  which  implies 
that  manager/organizational  goal  conflicts  are  considered 
as  higher  order  effects.  In  general,  there  appears  to  be 
a  wide  field  open  for  research  into  the  application  of 
modern  control  theory  to  practical  affairs  in  real  organi¬ 


zations  . 
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APPENDIX  A 

STATE-SPACE  REPRESENTATION  OF  A  DYNAMIC  SYSTEM 


APPENDIX  A 


STATE-SPACE  REPRESENTATION  OF 
A  DYNAMIC  SYSTEM12 

The  state-space  description  of  a  system  has  been 
used  exclusively  in  modern  control  and  systems  theory. 

Its  use  derived  from  the  motivation  to  represent  any 
physical  system  by  a  number  of  first-order  differential/ 
difference  equations  that  relate  an  equal  number  of  vari¬ 
ables.  If  at  any  given  time,  the  numerical  values  of 
these  variables  are  known,  the  state  of  the  system  is  com¬ 
pletely  specified,  and  if  future  inputs  to  the  system  are 
also  known,  the  state  of  the  system  at  any  future  time  is 
also  specified.  Using  the  state-space  approach  an  n  order 
difference  equation  can  be  described  as  n  first  order  dif¬ 
ference  equations  which  can  be  written  compactly  as 

x(t+l)  =  f (x (t) ,u(t) ,t)  (A-l) 

where  x(t)  is  an  n  dimensional  state  vector,  f  is  an  n 
dimensional  vector-valued  function  and  u(t)  is  an  r  dimen¬ 
sional  control  vector  (or  decision  vector) . 

12 

The  material  in  this  appendix  is  based  on 
Pindyck  (1973) . 
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The  advantage  of  describing  a  system  by  (A-l) 
is  that  it  gives  us  a  set  of  variables  that  completely 
determine  the  state  and  future  behavior  of  the  system. 

If  at  any  time,  t=0,  the  state  x(t=0)and  all  present  and 
future  values  of  the  control  u(t)  are  known,  then  we 
can  completely  determine  the  state  of  the  system  x(t)  for 
any  future  time,  t.  The  time  path  of  the  state  vector  is 
called  the  state  trajectory.  For  a  given  system  this 
will  be  determined  by  the  initial  state  and  the  time  path 
of  the  control  vector  (the  control  trajectory) . 


A 


APPENDIX  B 

DERIVATION  OF  OPTIMAL  CENTRALIZED  PLANNING  POLICY 
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APPENDIX  B 


DERIVATION  OF  OPTIMAL  CENTRAL  PLANNING  POLICY 


Problem 


Minimize : 


N-i  m  rp 

J  =  (1/2)  I  (X1(t)QX(t)  +  U  (t)RU(t) 


+  XT(t)SU(T)  +  C']  +  (1/2) XT (N) QX (N) 


(B-l) 


Subject  to: 


Dynamics 


X(t+1)  -  X(t)  =  AX  (t )  +  BU  ( t) 


Constraint 


CX(t)  +  DU  (t)  =  0 


Boundary  Conditions 


X  (t=0)  =  X  (0 ) 


tf  =  N. 


L(x(t)  ,u(t)  ,  t)  =  ( 1/2)  XT  (t)  QX  (t)  +  (l/2)UT(t)RU(t) 


+  ( 1/2) X  ( t) SU ( t)  +  (1/2)C 


K  (X (N) )  =  (1/2 ) XA (N)QX(N) 
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APPENDIX  B 


DERIVATION  OF  OPTIMAL  CENTRAL  PLANNING  POLICY 

Problem 

Minimize : 

N-i  rn  T 

J  =  (1/2)  £  (X  (t)QX(t)  +  lT(t)RU(t) 

t=0 

+  XT(t)SU(T)  +  C']  +  (1/2)  XT  (N)QX  (N)  (B-l) 

Subject  to: 

Dynamics 

X(t+1)  -  X(t)  =  AX  (t )  +  BU  ( t) 

Constraint 

CX(t)  +  DU  (t )  =  0 

Boundary  Conditions 
X (t=0)  =  X(0) 

tf  =  N. 

let: 

L(x(t)  ,u(t) ,t)  =  (1/2) XT (t) QX  (t)  +  (1/2) UT (t) RU (t) 

+  (l/2)XT(t)SU(t)  +  (1/2)C 
K  (X (N) )  =  ( 1/2 ) XT (N) QX (N) 
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Ill 


f  (x(t)  ,u(t)  ,t)  =  AX  +  BU 
p ( x ( t ) ,u(t) ,t)  =  CX  +  DU 

from  eq.  (2-8) 

3H/3U |  «  0  =  RU * ( t )  +  (l/2)STX*(t)  +  BTp*(t+l) 

* 

-  DTu*(t+l) 

thus 

u*(t)  =  -R_1[(l/2)STX*(t)  +  BTp*(t+l) 
from  eq.  (2-9) 

x*(t+l  -  x * ( t )  =  AX* ( t )  +  BU* ( t) 

from  eq.  (2-10) 

p*(t+l)  -  p*(t)  =  -  QX* ( t)  -  (1/2 ) SU* ( t)  -  ATp* (t+1) 

+  CTu*(t+l)  (B-4 ) 

from  eq.  (2-11) 

CX*(t)  +  DU* ( t )  =  0  (B-5) 

from  eq.  (2-14) 

p*(N)  =  QX* (N)  (B-6) 

NOTE  (omit  *  in  remainder  of  derivation) 
from  (B-5) 

DU ( t )  =  -CX(t) 


-  DTu* (t+1) ] 

(B-2) 

(B-3) 
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premultiply  both  sides  of  ( B—  2 ) 

DU(t)  =  -DR*1 [ (1/2) STX(t)  +  BTp(t+l)  -  DTu(t+l)] 

thus 

-CX(t) 

CX(t) 

rewriting 

DR*1DTu(t+l)  =  [ (1/2)DR*1ST  -  C)X(t)  +  DR*1BTp(t+l) 

NOTE  (DR  1dT  =  scalar,  thus  (DR  1DT)  1  exists) 
therefore 

u  ( t+ 1 )  >  (DR"1DT)“1[f(l/2)DR*1ST  -  C]X(t) 

+  DR*1BTp(t+l) ] 

for  convenience,  define 

F  =  (DR*1DT)*1:  a  scalar. 

thus 

u(t+l)  =  [ (1/2)FDR*1ST  -  FC] X ( t)  +  (FDR*1BT]p(t+l) 

(B-7 ) 

substituting  (B-7)  into  (B-2) 

u ( t )  =  -R_1[(l/2)STX(t)  +  BTp(t+l)  -  (l/2)FDTDR-1STX(t) 
+  FDTCX(t)  -  FDTDR*1BTp(t+l)  ] 

u ( t )  =  -R*1 [ [ ( 1/2 ) ST  -  (l/2)FDTDR*1ST]X(t) 

+  [BT  -  FDTDR"1BT]p(t+l) ]  (B-8) 


-  -DR_1[(l/2)STX(t)  +  BTp(t+l)  -  DTu ( t+1) ] 

=  (1/2) DR-1STX(t)  +  DR*1BTp(t+l)  -  DR_1DTu(t+l) 
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substituting 

P(t+1)  - 


rewriting 

p(t+l)  - 


substituting 
x(t+l)  - 

x(t+l)  - 


substituting 
p(t+l)  - 


(B-7)  into  (B-4) 

p(t)  =  -QX(t)  -  (1/2) SU (t)  -  ATp(t+l) 

+  [ (1/2) FCTDR*1ST  -  FCTC]X(t) 

T  -IT 

+  FC  DR  B  p (t+1 ) 

p(t)  =  [ (i/2)fctdr~1st  fc  c  Qlx(t) 

+  (FCTDR“1BT  -  AT]p(t+l) 

+  FCTDR~1BTp(t+l)  (B-9) 

(B-8)  into  (B-3) 

X ( t )  =  AX ( t)  -  BR-1[[(1/2)ST  -  (1/2)FDTDR_1ST 
+  FDTC]X(t)  +  [BT  -  FDTDR-1BT]p(t+l) ] 

X ( t )  =  [A  -  <1/2)BR_1ST  +  (1/2)FBR-1DTDR-1ST 

-IT  -1T-1T 

-  FBR  D  C]X(t)  +  [FBR  D  DR  B 

-  BR“1BT]p(t+l)  (B-10) 

(B-8)  into  (B-9) 

p(t)  =  [ (1/2)FCTDR"1ST  -  FCTC  -  Q] X ( t) 

T  -1  T  T 
+  [FC  DR  B  -  A  ]p(t+l) 

+  (1/2) SR"1 [ [ (1/2) ST  -  (1/2)FDTDR~1ST 

+  FDTC]X(t)  +  [BT  -  FDTDR_1BT]p(t+l) ] 
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p(t+l)  -  p(t)  =  ( {1/2)FCTDR_1ST  -  FCTC  -  Q 

-IT  -IT  -IT 

+  ( 1/4 ) SR  XS  -  ( 1/4 ) FSR  D  DR  S 

+  (l/2)FSR_1DTC]X(t) 

T  —  IT  T  —  IT 

+  [FC  DR  B  -  A  +  ( 1/2 ) SR  ~B 
-  (l/2)FSR_1DTDR~1BT]p(t+l)  (B-ll) 

Thus  we  have  two  vector-matrix  difference  equa¬ 
tions  (B-10)  and  (B-ll)  subject  to  the  split  boundary  con¬ 
ditions:  X ( t=Q)  =  X (0)  and  p(N)  =  QX(N).  Such  problems 
are  called  two  point  boundary  value  problems  and,  in 
general,  they  are  difficult  to  solve.  Notice  that  equa¬ 
tions  (B-10)  and  (B-ll)  are  coupled  since  X(t+1)  depends 
on  p(t+l)  and  p(t+l)  depends  on  X(t).  We  assume  that 
these  variables  are  linearly  related  (this  is  known  as 
the  sweep  method  for  solving  a  linear  two  point  boundary 
problem) . 

Thus 

p(t)  =  K ( t )  X(t)  (B-12 ) 

and  we  will  see  later  that  this  will  result  in  a  unique 
solution. 

Substituting  (B-12)  into  (B-8) 

U ( t )  =  -R"1{[(1/2)ST  -  (1/2)FDTDR“1ST  +  FDTC}X(t) 

+  [BT  -  fdtdr'1bt] [K(t+l)X(t+l) ]] 

j 


+  fFR"1DTDR"1BT  -  R_1BT]K(t+l)X(t+l)  (B-13) 

substituting  (B-12)  into  (B-10) 

X(t+1)  -  X(t)  =  [A  -  (1/2)BR"1ST  +  (1/2)FBR-1DTDR-1ST 

-  FBR_1DTC]X(t)  +  [FBR_1DTDR“1B,r 

-  BR~1BT] K (T+l) X ( t+1)  (B-14) 

rewriting 

[I  +  BR_1BTK{t+l)  -  FBR_1DTDR-1BTK(t+l)]X{t+l) 

=  [I  +  A  -  (1/2)BR~1ST  +  (1/2)FBR“1DTDR“1ST 

-  FBR“1DTC]X(t)  (B-15) 

substituting  (B-12)  into  (B-ll) 

(RH  side) 

p(t+l)  -  p(t)  =  [ (1/2)FCTDR-1ST  -  FCTC-Q+(1/4)SR_1ST 

-  (1/4) FSR"1DTDR"1ST 
+  {1/2)FSR"1DT  C]X(t) 

+  [FCTDR"1BT  -  AT  +  (1/2)SR_1BT 

-  ( 1/2 ) FSR-1DTDR-1BT] K ( t+l) X (t+l) 

(LH  side) 

T  —1  T  T  —IT 

II  -  FC  DR  B  +  A  -  ( 1/2 ) FSR  B 

+  (l/2)FSR"1DTDR_1BT]K(t+l)X(t+l)  -  K(t)X(t) 
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=  ( (1/2) FCTDR'1ST  -  FCTC  -  Q  +  (1/4)SR~1ST 
-  (1/4)FSR_1DTDR~1ST  +  (l/2)FSR_1DTC]X(t)  (B-16) 

let 

E  =  [I  +  BR-1BTK(t+l)  -  FBR“1DTDR~'LBTK(t+l)  ] 

thus 

X(t+1)  =  E_1  [ I  +  A  -  <1/2)BR~1ST  +  {1/2)FBR_1DTDR"1ST 
-  FBR-1DTC]X(t)  (B-17) 

substituting  (B-17)  into  (B-16) 

T  -1  T  T  —1  T 

[I  -  FC  DR  B  +  A1  -  ( 1/2 ) SR  XBX 

+  (l/2)FSR~1DTDR“1BT]K(t+l)E'1 [I  +  A  -  (1/2)BR-1ST 
+  (l/2)FBR_1DTDR"1DTC]X(t) 

=  [K ( t )  +  (1/2)  FCTDR-1ST  -  FCT  -  Q  +  (1/4)SR~1ST 

-  (1/4)FSR-1DTDR-1ST  +  (l/2)FSR-1DTC]X(t) 

equating  coefficients 

[I  -  FC  DR  B  +  A  -  ( 1/2 ) SR  XB 

+  (l/2)FSR~1DTDR'1BT]K(t+l)E":L[I  +  A  -  (1/2)BR_1ST 
+  (1/2) fbr-1dtdr-1st  -  FBR_1DTC^ 

=  K(t)  +  ( 1/2 ) FCTDR~1s'r  -  FCTC  -  Q  +  (1/4)SR”1ST 

-  (1/4)FSR“1DTDR“1ST  +  (1/2)FSR_1DTC 
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let 

H  =  [I  +  A  -  (1/2)BR_1ST  +  (l/2)FBR-1DTDR';LS,r 
-1  T 

-  FBR  DC] 

let 

L  =  [- (1/2)FSR*1DTC  +  (1/4)FSR"1DTDR-1ST 

-  (1/4)SR_1ST 

+  Q  +  FCTC  -  (1/2)FCTDR~1ST] 


thus 

K ( t )  =  HTK(t+l)E_1H  +  L  (B-18) 

from  (B-6)  and  (B-12)  we  have 
p (N)  =  QX(N)  =  K (N) X (N) 

thus 

K(N)  =  Q  (B-19) 

We  can  solve  eq.  (B-18)  backward  to  find  K(t) , 
t  =  1 , . . . ,N. 

Substituting  (B-17)  into  (B-13)  yields 

U ( t) =  [ (1/2)FR'1DTDR"1ST  -  (1/2)R~1ST  -  FR'1DTC] X ( t) 

+  [FR_1DTDR-1BT  -  R”1BT] K(t+l)E_1HX(t) 
for  convenience,  define 

J  =  [ (1/2)FR"1DTDR_1ST  -  (1/2)R"1ST  -  FR_1DTC] 

-1  T  —1  T  —1  T 
M  =  [FR  D  DR  BA  -  R  BAJ 

: 

1 

; 

1 
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thus 

U*(t)  =  JX<t)  +  MK{t+l)E_1HX(t)  (B-20) 

recall 

E  =  [I  +  BR_1BTK(t+l)  -  FBR_1DTDR~1BTK(t+l) ] 

using  the  matrix  identity: 

T  —  1  T  —  1  T 

(I  +  ST  )  =  I  -  S(I  +  T  S)  T 

'  n  r  r 

where  S  is  (nxr) ,  T  is  (nxr)  and  r<n 

E  =  [I  +  B(R-1BT  -  FR”1DTDR“1BT)K(t+l) ] 

let 

S  =  B  and  TT  =  (R-1BT  -  FR_1DTDR"1BT) K(t+1) 

then 

-1  —IT  -1  T  — 1  T  —1 

E  =  I  -  B [ I  +  (R  B  -  FR  D  DR  Bx)K(t+l)B] 

-1  T  —1  T  -IT 

(R  B  -  FR  D  DR  B  )K(t+l) 

define 

T  T  -1  T 

V  =  [B  -  FD  DR  B  ] 

thus 

E_1  =  I  -  B ( I  +  R_1VK(t+l) B) "1R_1VK(t+l) 

E_1  =  1  -  B [R (I  +  R_1VK(t+l)B) ]-1VK(t+l) 

E_1  =  I  -  BIR  +  VK ( t+1) B] _1VK ( t+1)  (B-21) 


1 
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note  that  we  now  only  need  to  invert  an  (rxr)  matrix 
instead  of  an  (nxn)  matrix 

substituting  (B-21)  into  (B-18) 

K ( t)  =  HTK{t+l)[I  -  B [ R  +  VK(t+l)B]~1VK(t+l) ]H  +  L 

(B-22) 

substituting  (B-21)  into  (B-20) 

U* (t)  =  JX(t)  +  MK ( t+1 ) 

(I  -  B [ R  +  VK ( t+1 ) B] _1VK (t+1) HX ( t )  (B-23) 

Equation  (B-23)  determines  the  optimal  control 
in  terms  of  the  present  state  and  solutions  of  the 
"Ricatti"  equation  (B-22) .  Once  the  system  has  been 
defined  (matrices  A  and  B) ,  the  additional  constraints 
identified  (matrices  C  and  D)  and  the  performance  measure 
determined  (matrices  Q,  R,  and  S)  the  optimal  decision 
can  be  found  as  follows: 

1.  Solve  the  Riccati  equation  (B-22)  with  boun¬ 
dary  condition  (B-19)  backward  in  time  to  get  K(t)  for 

t  =  1,...,N-1.  Store  the  resulting  (nxn)  matrices  (N  of 
them) . 

2.  Compute  the  optimal  control  U*(0)  from  equa¬ 
tion  (B-23)  using  the  initial  conditions  X(t=0)  *  X(0) 
and  the  matrix  K(l). 

3.  Compute  the  next  state  using  equation  (B-3) . 
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4.  Compute  the  next  optimal  control  using  equa¬ 
tion  ( B— 2  3 )  . 

5.  Iterate  steps  (3)  and  (4)  until  all  U*(t), 
t  =  0,...,N— 1  and  all  X*(t)  t  =  1,...,N  have  been  com¬ 
puted. 

Application  of  the  above  algorithm  to  the  model 
developed  in  Chapter  II  results  in  the  following 
simplifications  (since  V  and  M  are  null  matrices  in  the 
model)  : 

K  ( t )  =  HTK(t+l)H  +  L  (B-22a) 

U*(t)  =  JX*(t)  (B-23a) 

where  H,  L,  and  J  are  defined  above. 
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OPTIMAL  PLANNING  POLICY  FOR  CENTRALIZED  FIRM 


Period  One 

RAW  MATERIAL  PRICE  =  20750.000 
DEMAND  PARAMETER  (B2)  =  5000.0000 
INVENTORY  =  0.0000000 

PRODUCTION  QUANTITY  DECISION  =  1448.0198 
PRODUCT  PRICING  DECISION  =  35519.802 


Period  Two 

RAW  MATERIAL  PRICE  =  21372.500 
DEMAND  PARAMETER  (B2)  =  5250.0000 
INVENTORY  =  -2 . 91038305E-11 
PRODUCTION  QUANTITY  DECISION  =  1540.9653 
PRODUCT  PRICING  DECISION  =  37090.347 


Period  Three 

RAW  MATERIAL  PRICE  =  22013.675 
DEMAND  PARAMETER  (B2)  =  5512.5000 
INVENTORY  =  -5 . 82076609E-11 
PRODUCTION  QUANTITY  DECISION  =  1639.1745 
PRODUCT  PRICING  DECISION  =  38733.255 


Period  Four 

RAW  MATERIAL  PRICE  =  22674.085 
DEMAND  PARAMETER  (B2)  =  5788.1250 
INVENTORY  =  -  2. 9 10 3 8 30 5E- 11 
PRODUCTION  QUANTITY  DECISION  =  1742.9289 
PRODUCT  PRICING  DECISION  =  40451.961 


Period  Five 

RAW  MATERIAL  PRICE  =  23354.308 
DEMAND  PARAMETER  (B2)  =  6077.5313 
INVENTORY  =  - 2 . 9 10 38 30 5E-11 
PRODUCTION  QUANTITY  DECISION  =  1852.5250 
PRODUCT  PRICING  DECISION  =  42250.063 
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